The term Bigdata demonstrates that this should be large
chunks of data. But the question you're probably left wondering is how large
data can possibly be? Well, let's consider this serious. example.
One fine morning commander-in-chief of an army organization
thinks his organization should monitor world happiness. So, he ordered his
fellows to track all the funny video searches of 'epic fails', 'bloopers', 'misses'
and 'lolls’, being watched on Google, YouTube, Facebook, and Twitter are to be
tracked, geo-located, and the trend to be analyzed on a real-time basis
throughout the world.
Now a simple google search will tell you that every minute
the number of videos uploaded to the world wide web is nearly 18TB of 720p mp4
codec. So even if this guy forgets all the pre-uploaded videos, he has to
analyze a significant portion of 18TBs of video files every minute. Now if you
add up the image files, real-time chats, and web searches, you'll end up
having approximately 24TBs of data generating every minute that too with an
increasing trend. This is Bigdata.
In other words, data that is not manageable by human
capabilities is big data. Big data is all around us wherever we go, wherever we
look there is big data in some form or the other.
Sensors are all around us, it is today present in all the
devices. These sensors themselves generated a huge amount of data and thus
sensor data is also one of the sources of big data.
Data generated every second in some form or the other, be it
from the transactions we make, from our images and videos, from our surveys,
from credit cards or debit cards or sensors, all are collectively called big
data.
Big data is generally said to have these major properties
like Volume, Variety, and Velocity.
Volume: is the humungous amount of data that is generated
from different sources like social media, cell phones, sensors, public releases
in the form of public data, photographs, videos. This data is so large that it
cannot be stored using traditional techniques to store and analyze data.
Variety: Variety comes into the picture when this huge
amount of data can be in different forms. These days, we no longer have just
the structured data that is in the form of relations but also unstructured and semi-structured data that cannot be stored in relational databases. Today,
80% of the total data is unstructured.
Velocity: Velocity is one of the key factors as no one likes
data coming at a lower speed. Therefore, speed plays a crucial role. Velocity
is the speed at which this data is collected, stored, and analyzed. Big data
technologies allow us to now analyze the data without even storing it into
traditional databases.
Our old-school database management software will simply
crash under this load. You'll need different sorts of database management
software for this kind of analysis.
إرسال تعليق