High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Cloud Dataflow using Millwheel and Flume Java


Vani Ganji , v.e.s.institute of technology; Sachin Bhandari, v.e.s.institute of technology; Dhanamma Jagli, v.e.s.institute of technology


Pipeline; Sdk; Prefix; Stream; Optimize; Graph; Analysis


Cloud DataFlow is a stream analysing tool for cloud storage. It is a system for building big and fast data analysis pipelines. Dataflow is based on few of technologies the company has beenusinginternally,including Flume and MillWheel. This article focuses on these technologies as well as the core components of DataFlow. An example of twitter hashtag auto-completion tool is considered and discussed by implementing it using this technology.

Other Details

Paper ID: IJSRDV3I40528
Published in: Volume : 3, Issue : 4
Publication Date: 01/07/2015
Page(s): 837-841

Article Preview

Download Article