Co-Clustered: Data Co-Clustering using Distributed Framework for Sequential Restores |
Author(s): |
Prasitha Unnikrishnan , Sree Narayana Guru College; E. K. Girisan, Sree Narayana Guru College |
Keywords: |
Co-Clustering, Sequential Updates, Cloud Computing, Distributed Framework, FNMTF, ITCC |
Abstract |
Co-clustering is an effective data mining tool for existence and diploid data. As data sets become increasingly large, the scalability of existence becomes increasingly crucial. In this paper, we propose two approaches to parallelize co-clustering with subsequent amend in a distributed environment. Based on these two approaches, we present a new distributed framework, Co-ClusterD, that backing effective utilization of co-clustering algorithms with sequential updates. We developed and applied Co-ClusterD, and give its ability through two co-clustering algorithms: information theoretic co-clustering (ITCC), fast nonnegative matrix tri-factorization (FNMTF). We evaluate our framework on both a local cluster of the systems and the Amazon EC2 cloud. Our assessment gives that co-clustering algorithms implemented in Co-ClusterD can attain excellent outputs and run faster than their traditional concurrent counterparts. |
Other Details |
Paper ID: IJSRDV4I70259 Published in: Volume : 4, Issue : 7 Publication Date: 01/10/2016 Page(s): 359-364 |
Article Preview |
|
|