HIGH IMPACT FACTOR - 2.39

Exploring Big Data Analysis Pipeline and Comparative Summarization of Mining Tools

Author(s):

Ms. Deepali Bajaj , Shaheed Rajguru College of Applied Sciences for women, University of Delhi; Ms. Asha Yadav, Shaheed Rajguru College of Applied Sciences for women, University of Delhi

Keywords:

Rapid-I Rapidminer, KDNuggets, Processing Pipeline Phases

Abstract

From past some years Big data has become a big hype and buzz word in IT industry. It is a major concern of research for data driven industries where massive data needs to be processed and analysed to acquire in-depth knowledge of useful information. It has a huge potential for unlocking the emerging trends, projecting the upcoming growth techniques, increasing productivity, and competitiveness for entire sectors and economies. In this paper, we have reviewed the elements of big data and discussed the phases of processing pipeline of Data Analysis. Further the paper elaborates the most popular and specialized Big Data mining tools as per the survey conducted by KDNuggets in 2015 on analytics and data mining community and vendors. This paper also elucidates the contrasts between two top-ranking tools R and Rapid-I Rapidminer.

Other Details

Paper ID: NCILP032
Published in: Conference 1 : NCIL 2015
Publication Date: 16/10/2015
Page(s): 124-127

Article Preview




Download Article