Survey on Big Data |
Author(s): |
Manju Lakshmi , Mount zion college of engineering; Smita C Thomas, Mount zion college of engineering |
Keywords: |
Big data, Hadoop |
Abstract |
Big data is the term for any collection of data sets so large and complex that it becomes difficult to process using traditional data processing applications. The challenges include analysis, capture, curation, search, sharing, storage, transfer, visualization, and privacy violations. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared to separate smaller sets with the same total amount of data, allowing correlations to be found to "spot business trends, prevent diseases, combat crime and so on." Big data is difficult to work with using most relational database management systems and desktop statistics and visualization packages, requiring instead "massively parallel software running on tens, hundreds, or even thousands of servers". Big data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process data within a tolerable elapsed time. Big data "size" is a constantly moving target, as of its ranging from a few dozen terabytes to many petabytes of data. |
Other Details |
Paper ID: IJSRDV5I90064 Published in: Volume : 5, Issue : 9 Publication Date: 01/12/2017 Page(s): 134-138 |
Article Preview |
|
|