High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

A Survey on Text Document Clustering Methodologies Based On Similarity Measure

Author(s):

R.Saranya , K.S.R COLLEGE OF ENGINEERING,TAMILNADU,INDIA; V.Sharmila, K.S.R COLLEGE OF ENGINEERING,TAMILNADU,INDIA; P.Balamurugan, K.S.R COLLEGE OF ENGINEERING,TAMILNADU,INDIA; R.Latha, K.S.R COLLEGE OF ENGINEERING,TAMILNADU,INDIA

Keywords:

Data mining, Text mining, Document clustering, Similarity measure

Abstract

Text data mining is research technologies to mine valuable knowledge from massive collections of documents and to improve a system to offer information and to support in decision making. Clustering is an automatic wisdom methodology aimed to combine a set of objects which are similar to each other. Text clustering has turned into a challenging process in recent centuries because of the massive quantity of unstructured data is presented in several formations. In Text document clustering grouping of text documents arises based upon their similarity. There are many rapid and high-excellence document clustering algorithms available which play a main role in efficiently establishing the information. In this paper we are going to discuss various methodology of clustering which is based on the document similarity.

Other Details

Paper ID: IJSRDV3I70291
Published in: Volume : 3, Issue : 7
Publication Date: 01/10/2015
Page(s): 771-773

Article Preview

Download Article