Cloud Dataflow using Millwheel and Flume Java |
Author(s): |
| Vani Ganji , v.e.s.institute of technology; Sachin Bhandari, v.e.s.institute of technology; Dhanamma Jagli, v.e.s.institute of technology |
Keywords: |
| Pipeline; Sdk; Prefix; Stream; Optimize; Graph; Analysis |
Abstract |
|
Cloud DataFlow is a stream analysing tool for cloud storage. It is a system for building big and fast data analysis pipelines. Dataflow is based on few of technologies the company has beenusinginternally,including Flume and MillWheel. This article focuses on these technologies as well as the core components of DataFlow. An example of twitter hashtag auto-completion tool is considered and discussed by implementing it using this technology. |
Other Details |
|
Paper ID: IJSRDV3I40528 Published in: Volume : 3, Issue : 4 Publication Date: 01/07/2015 Page(s): 837-841 |
Article Preview |
|
|
|
|
