High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Survey on Big Data

Author(s):

Manju Lakshmi , Mount zion college of engineering; Smita C Thomas, Mount zion college of engineering

Keywords:

Big data, Hadoop

Abstract

Big data is the term for any collection of data sets so large and complex that it becomes difficult to process using traditional data processing applications. The challenges include analysis, capture, curation, search, sharing, storage, transfer, visualization, and privacy violations. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared to separate smaller sets with the same total amount of data, allowing correlations to be found to "spot business trends, prevent diseases, combat crime and so on." Big data is difficult to work with using most relational database management systems and desktop statistics and visualization packages, requiring instead "massively parallel software running on tens, hundreds, or even thousands of servers". Big data usually includes data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process data within a tolerable elapsed time. Big data "size" is a constantly moving target, as of its ranging from a few dozen terabytes to many petabytes of data.

Other Details

Paper ID: IJSRDV5I90064
Published in: Volume : 5, Issue : 9
Publication Date: 01/12/2017
Page(s): 134-138

Article Preview

Download Article