Skip to main content

Post 7: (Question 8): Characteristics of big data analysis

Within big data there is always a will to manage this data and to do this the data first needs to be characterised and to organise our understanding of this big data. Due to this Big Data can and is defined by more than one characteristic. There are in fact 3 characteristics that need to be taken into account and these are Volume, Velocity and Variety. Volume refers to the size of the data that is continuously growing within the world of computing and this raises the question of the quantity of the data itself. Velocity refers to the speed at which the data is processed and this can also be questioned within itself. Variety however refers to the varying types of data, this allows us to question just how each data format differs from one another. 
These characteristics also raise some very important questions that allow us and aid us in deciphering Big Data but they also aid us in learning how to deal with massive and varying data at a manageable pace and within a reasonable time frame so that the value of the data can be deciphered, be analysed and a subsequent response can be provided as swiftly as possible. 

Comments

Popular posts from this blog

FutureLearn Week 2: Post 1 of 4

Open data has been increasing for some time now with data being made open on various sites globally. There are many advantages to having open data, these advantages include being able to share public data sets so that they can be compared. These open data sources can also be used for environmental purposes or even health issues. Disadvantages of open data would include the fact that the site providing the data would be inherently biased and formed in the opinion of the creator.

FutureLearn Week 2: Post 3 of 4

Two of the biggest challenges of big data is Analysing and Visualising the data. Firstly with analysing the data, the size of big data files can sometimes be substantial, there are many things that must be considered before downloading the data, for example the file size, how long the data file will take to download, will all of it be necessary or will part of the file suffice and is there enough storage space within the system itself. Visualisation is way to represent the data in a way that is easier to understand such as word clouds and things of the like. This will aid users in seeing the prominent and key terms from the analysis of the data sets. The first step after downloading the data would be to quality check it to ensure that each field had the appropriate data types in each field and to ensure that the user understood the meaning of each field. Keeping a copy of the original data would be essential as well as each documented version change for each stage of visualisation....

FutureLearn Week1: Post 2 of 3

The best example of environmental big data is that of weather forecasting, as discussed in the video on futurelearn. The use of drones to monitor and measure atmospheric pollutants is another form of environmental big data. This can all be used to reduce carbon footprint by making use of the data to reduce or introduce more efficient transport in areas with high levels of pollution.