High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

A Survey on Document Clustering

Author(s):

Latika , Kurukshetra University, Kurukshetra, India

Keywords:

Document Clustering, Evaluation Measures, Hierarchical Clustering, Partitioning Clustering, Similarity Measures, Vector Space Model

Abstract

Due to the advancement of internet, the volume of electronic documents available on the web is exploding. The need to organize similar documents together has attracted the attention of researchers in this area. Document clustering provides automatic organization of documents into groups so that documents in a group are similar to each other and documents in different groups are dissimilar. It plays important role in information retrieval, organizing search engine results, web mining etc. Several techniques have been provided for efficient document clustering. This paper presents a brief survey on various techniques used for document clustering including traditional, fuzzy based, GA, Hybrid etc. Also, document clustering procedure with document representation model, dimension reduction techniques, similarity measures and evaluation measures are explained.

Other Details

Paper ID: IJSRDV3I31230
Published in: Volume : 3, Issue : 3
Publication Date: 01/06/2015
Page(s): 2115-2120

Article Preview

Download Article