Face to Face with Apache Hadoop

Data – The Big Thing

We live in the age of data. It’s nearly impossible to tell exactly what the total amount of data stored electronically is but according to estimation digital universe has grown to nearly 1.8 zettabytes. A zettabytes is 1021 bytes or one million petabytes or one billion terabytes. This vast amount of digital data would fill DVD stack reaching from the Earth to moon and back. This is too much of data.

From where does this huge pile of data come from. Let’s have a look on some of the major sources:

  • More than 144.8 billion email messages are sent a day.
  • Users add 300 million new photos each day on Facebook, adding up more than one petabytes of data each day.
  • Flickr photographers upload 3,125 new photos a minute.
  • WordPress bloggers publish close to 350 new blog posts a minute.
  • People upload 72…

