Future of Big Data beyond Batch Processing |
Author(s): |
| Mansi Shah , N.S.I.T, Jetalpur; Vatika Tayal, N.S.I.T, Jetalpur |
Keywords: |
| Big data, MapReduce, Hadoop, Real-time processing, Stream processing |
Abstract |
|
In recent years, big data are generated from a variety of sources, and there is an enormous demand for storing, managing, processing, and querying on big data. The MapReduce framework and its open source implementation Hadoop, has proven itself as the de facto solution for processing large amounts of data in parallel, and is intrinsically designed for batch processing and high throughput jobs. Although Hadoop has proven as de facto solution for batch jobs, there is growing demand for non-batch applications like: real-time queries, interactive jobs, and big data streams. Since Hadoop is not suitable for these non-batch jobs, new solutions are proposed to meet these new challenges. In this paper, we discuss the strength, features, and shortcomings of the standard MapReduce framework and its open source implementation Hadoop. In addition; we have discussed the significant extensions of MapReduce. Further, we have considered two categories of these solutions: real-time processing, and stream processing of big data. For each category, we have included paradigms and strengths. |
Other Details |
|
Paper ID: IJSRDV3I1124 Published in: Volume : 3, Issue : 1 Publication Date: 01/04/2015 Page(s): 217-220 |
Article Preview |
|
|
|
|
