High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Co-Clustered: Data Co-Clustering using Distributed Framework for Sequential Restores

Author(s):

Prasitha Unnikrishnan , Sree Narayana Guru College; E. K. Girisan, Sree Narayana Guru College

Keywords:

Co-Clustering, Sequential Updates, Cloud Computing, Distributed Framework, FNMTF, ITCC

Abstract

Co-clustering is an effective data mining tool for existence and diploid data. As data sets become increasingly large, the scalability of existence becomes increasingly crucial. In this paper, we propose two approaches to parallelize co-clustering with subsequent amend in a distributed environment. Based on these two approaches, we present a new distributed framework, Co-ClusterD, that backing effective utilization of co-clustering algorithms with sequential updates. We developed and applied Co-ClusterD, and give its ability through two co-clustering algorithms: information theoretic co-clustering (ITCC), fast nonnegative matrix tri-factorization (FNMTF). We evaluate our framework on both a local cluster of the systems and the Amazon EC2 cloud. Our assessment gives that co-clustering algorithms implemented in Co-ClusterD can attain excellent outputs and run faster than their traditional concurrent counterparts.

Other Details

Paper ID: IJSRDV4I70259
Published in: Volume : 4, Issue : 7
Publication Date: 01/10/2016
Page(s): 359-364

Article Preview

Download Article