A Survey on Documentation Modelling in Information Filtering using Pattern Recognition |
Author(s): |
Ketaki Gadwale , Pune Institute of Computer Technology, Pune, India; Dr. Emmanuel M., Pune Institute of Computer Technology, Pune, India |
Keywords: |
Topic modeling, Pattern mining, User interest model, Document relevance, Information filtering, Information retrieval |
Abstract |
In the field of information filtering many full-fledged term-based or pattern-based approaches have been used by assuming the documents in a collection are all about one topic. However, user’s interest can be varied and generally multiple topics are involved in the collection of documents. Topic modeling, such as Latent Dirichlet allocation (LDA), has been widely exploited in the fields of machine learning and information retrieval etc. but its effectiveness in information filtering has not been so well explored. A novel information filtering model, Maximum matched Pattern-based Topic Model (MPBTM), is proposed. Following features makes the proposed model peculiar: (1) In terms of multiple topics the user information need are generated and each topic is represented by patterns; (2)the most discriminate and representative patterns, called Maximum Matched Patterns are generated from topic models and are systematized in terms of their statistical and taxonomic features and proposed to estimate the documents relevance in order to filter out irrelevant documents suited to user to user’s information needs. |
Other Details |
Paper ID: IJSRDV5I10295 Published in: Volume : 5, Issue : 1 Publication Date: 01/04/2017 Page(s): 386-389 |
Article Preview |
|
|