Skip to main content

Post #4: Reasons for the Growth of Big Data

Big Data is continuously growing, each and every organisation is dealing with more and more data with each passing day and this growth shows no signs of slowing down. There are various reasons for this swift increase in growth, I will now discuss a few of these reasons.

Business models are one of the main reasons for this exponential growth through the aggressive and continuous acquisition and permanent retention of data. Google is a perfect example of a business that is retaining vast amounts of data and this is definitely working in their favour as can be seen by their company growth. Infrastructure capacity is another reason for the increase as the cost of data storage has become incredibly low over the past few years while the capacity seems to be increasing almost doubling in the space of a couple of years.  Business analytics has also seen an increased acceleration in the past few years and is now over a 100 billion dollar market and continues to grow year to year.

Regulations have also come into place which dictate what type of data that organisations must capture and retain as well as any permissions they must request. This has lead to security organisations capturing and storing data such as audio and visual surveillance as well as system logs.

With the emergence of cloud infrastructure the cloud data storage prices have been lowering over the past couple of years while still providing increases in data size depending on the needs of the consumers.

These factors in the increase of growth within big data is not new information, but organisations are beginning to see it as an opportunity rather than the problem that it is and with this data growing only more businesses will develop tools to take advantage of it. 




Comments

Popular posts from this blog

FutureLearn Week 2: Post 3 of 4

Two of the biggest challenges of big data is Analysing and Visualising the data. Firstly with analysing the data, the size of big data files can sometimes be substantial, there are many things that must be considered before downloading the data, for example the file size, how long the data file will take to download, will all of it be necessary or will part of the file suffice and is there enough storage space within the system itself. Visualisation is way to represent the data in a way that is easier to understand such as word clouds and things of the like. This will aid users in seeing the prominent and key terms from the analysis of the data sets. The first step after downloading the data would be to quality check it to ensure that each field had the appropriate data types in each field and to ensure that the user understood the meaning of each field. Keeping a copy of the original data would be essential as well as each documented version change for each stage of visualisation....

Post #6: Traditional Statistics: Descriptive and Inferential in Big Data

Two types of Traditional statistics in Big data include Descriptive and Inferential. Descriptive means averages and working on sets of numbers. Descriptive statistics is a type of statistic in which a data set is summarised and the characteristics are described. This descriptive data is usually displayed through the use of tables, charts etc. but is most commonly reported as a measure of a central tendency. A central tendency is a typical value for a distribution, it is also been known to be called a location or centre of the distribution. The arithmetic mean is the most common measure of a central tendency, this is the median and the mode. The mean is the average of all the values, the median is the exact middle of the data set while the mode is the most frequent value in the data set.  The goal of traditional statistics is analysing and summarising data, providing tight assumptions about the problem and data distributions as well as using conservative techniques and approaches....

FutureLearn Week 2: Post 1 of 4

Open data has been increasing for some time now with data being made open on various sites globally. There are many advantages to having open data, these advantages include being able to share public data sets so that they can be compared. These open data sources can also be used for environmental purposes or even health issues. Disadvantages of open data would include the fact that the site providing the data would be inherently biased and formed in the opinion of the creator.