High Impact Factor : 4.396 icon | Submit Manuscript Online icon | UGC Approved icon

Exploring Big Data Analysis Pipeline and Comparative Summarization of Mining Tools


Ms. Deepali Bajaj , Shaheed Rajguru College of Applied Sciences for women, University of Delhi; Ms. Asha Yadav, Shaheed Rajguru College of Applied Sciences for women, University of Delhi


Rapid-I Rapidminer, KDNuggets, Processing Pipeline Phases


From past some years Big data has become a big hype and buzz word in IT industry. It is a major concern of research for data driven industries where massive data needs to be processed and analysed to acquire in-depth knowledge of useful information. It has a huge potential for unlocking the emerging trends, projecting the upcoming growth techniques, increasing productivity, and competitiveness for entire sectors and economies. In this paper, we have reviewed the elements of big data and discussed the phases of processing pipeline of Data Analysis. Further the paper elaborates the most popular and specialized Big Data mining tools as per the survey conducted by KDNuggets in 2015 on analytics and data mining community and vendors. This paper also elucidates the contrasts between two top-ranking tools R and Rapid-I Rapidminer.

Other Details

Paper ID: NCILP032
Published in: Conference 1 : NCIL 2015
Publication Date: 16/10/2015
Page(s): 124-127

Article Preview

Download Article