High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Cloud Dataflow using Millwheel and Flume Java

Author(s):

Vani Ganji , v.e.s.institute of technology; Sachin Bhandari, v.e.s.institute of technology; Dhanamma Jagli, v.e.s.institute of technology

Keywords:

Pipeline; Sdk; Prefix; Stream; Optimize; Graph; Analysis

Abstract

Cloud DataFlow is a stream analysing tool for cloud storage. It is a system for building big and fast data analysis pipelines. Dataflow is based on few of technologies the company has beenusinginternally,including Flume and MillWheel. This article focuses on these technologies as well as the core components of DataFlow. An example of twitter hashtag auto-completion tool is considered and discussed by implementing it using this technology.

Other Details

Paper ID: IJSRDV3I40528
Published in: Volume : 3, Issue : 4
Publication Date: 01/07/2015
Page(s): 837-841

Article Preview

Download Article