Posts

Showing posts from January, 2013

Brief Overview on BigData and Hadoop

Big Data: The term Big Data applies to the information that can't be processed or analyzed using traditional processes. It’s basically processing terabytes of unstructured information to generate insights that are required for the businesses. Three basic characteristics of BigData: 1.     Volume : The data that is to be analyzed is in Terabytes. For example, Twitter alone generates around 7 terabytes (TB) data per day, Facebook around 10 TB/day. Analyzing this volume requires lot of hardware. 2.     Variety : The data is not only organized in traditional table patterns but also raw, semi structured, and unstructured data like weblogs, social media, blogs, sensors, images, videos etc. 3.     Velocity: It requires to analysis or process volume and variety data quickly. The velocity is very important as the data is still in motion, for example analysis real time market data, or analysis customer's browsing pattern while the time he is logged in. BigData case s