Categorization of Text Data using Meta Data |
Author(s): |
| Sandeep Jadhav , MET's IOE Nashik; Dr. K. V. Metre, MET's IOE, Nashik |
Keywords: |
| Text Data Mining, Categorization, Side Information, Mete Data |
Abstract |
|
In today’s digital environment, text databases are rapidly increasing due to use of internet and communication mediums. Different text mining techniques are used for knowledge discovery and Information retrieval. Text data contains the side information or side attribute with the text data. Side attribute may be the metadata associated with text data like author, coauthor or citation network, document provenance information, web links or other kind of data which provide more insights about the text documents. Such side information contains tremendous amount of information for the clustering purpose. Using such side information in the categorization process provides more refine clustered data. But sometimes side information may be noisy and results in wrong categorization which decreases the quality of clustering process. Therefore, a new advance method for mining of text data using side information is suggested, which combines partitioning approach with probabilistic estimation model for the mining of text data along with the side information. |
Other Details |
|
Paper ID: IJSRDV4I50414 Published in: Volume : 4, Issue : 5 Publication Date: 01/08/2016 Page(s): 931-935 |
Article Preview |
|
|
|
|
