A Survey on Document Clustering |
Author(s): |
| Latika , Kurukshetra University, Kurukshetra, India |
Keywords: |
| Document Clustering, Evaluation Measures, Hierarchical Clustering, Partitioning Clustering, Similarity Measures, Vector Space Model |
Abstract |
|
Due to the advancement of internet, the volume of electronic documents available on the web is exploding. The need to organize similar documents together has attracted the attention of researchers in this area. Document clustering provides automatic organization of documents into groups so that documents in a group are similar to each other and documents in different groups are dissimilar. It plays important role in information retrieval, organizing search engine results, web mining etc. Several techniques have been provided for efficient document clustering. This paper presents a brief survey on various techniques used for document clustering including traditional, fuzzy based, GA, Hybrid etc. Also, document clustering procedure with document representation model, dimension reduction techniques, similarity measures and evaluation measures are explained. |
Other Details |
|
Paper ID: IJSRDV3I31230 Published in: Volume : 3, Issue : 3 Publication Date: 01/06/2015 Page(s): 2115-2120 |
Article Preview |
|
|
|
|
