High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Future of Big Data beyond Batch Processing

Author(s):

Mansi Shah , N.S.I.T, Jetalpur; Vatika Tayal, N.S.I.T, Jetalpur

Keywords:

Big data, MapReduce, Hadoop, Real-time processing, Stream processing

Abstract

In recent years, big data are generated from a variety of sources, and there is an enormous demand for storing, managing, processing, and querying on big data. The MapReduce framework and its open source implementation Hadoop, has proven itself as the de facto solution for processing large amounts of data in parallel, and is intrinsically designed for batch processing and high throughput jobs. Although Hadoop has proven as de facto solution for batch jobs, there is growing demand for non-batch applications like: real-time queries, interactive jobs, and big data streams. Since Hadoop is not suitable for these non-batch jobs, new solutions are proposed to meet these new challenges. In this paper, we discuss the strength, features, and shortcomings of the standard MapReduce framework and its open source implementation Hadoop. In addition; we have discussed the significant extensions of MapReduce. Further, we have considered two categories of these solutions: real-time processing, and stream processing of big data. For each category, we have included paradigms and strengths.

Other Details

Paper ID: IJSRDV3I1124
Published in: Volume : 3, Issue : 1
Publication Date: 01/04/2015
Page(s): 217-220

Article Preview

Download Article