Fast and Efficient Classification of Text Documents from Repository |
Author(s): |
| Ajinkya Pawar , Matoshri College Of Engineering And Research Center, Nashik; Aparna Pulate, Matoshri College Of Engineering And Research Center, Nashik; Rushikesh Handge, Matoshri College Of Engineering And Research Center, Nashik; Vrushali Kolhe, Matoshri College Of Engineering And Research Center, Nashik |
Keywords: |
| Data Mining, Master Taxonomy, Target Taxonomy, Classification, Cat log Integration |
Abstract |
|
An axiological abstracts affiliation assignment faced by online bartering portals and business seek engines is the affiliation of products advancing from assorted providers to their artifact catalogs. In this scenario, the bartering aperture has its own anatomy (the “master taxonomyâ€), while anniversary abstracts provider organizes its articles into a altered anatomy (the “provider taxonomyâ€). In this paper, we accede the botheration of allocation articles from the abstracts providers into the adept taxonomy, while authoritative use of the provider taxonomy information. Our access is based on a taxonomy-aware processing footfall that adjusts the after-effects of a text-based classifier to ensure that articles that are abutting calm in the provider anatomy abide abutting in the adept taxonomy. We codify this intuition as a structured anticipation enhancement problem. To the best of our knowledge, this is the aboriginal access that leverages the structure of taxonomies in adjustment to enhance archive integration. We adduce algorithms that are scalable and appropriately applicative to the large abstracts sets that are archetypal on the web. We apprise our algorithms on real-world abstracts and we appearance that taxonomy-aware classification provides a cogent advance over absolute approaches. |
Other Details |
|
Paper ID: IJSRDV3I2111 Published in: Volume : 3, Issue : 2 Publication Date: 01/05/2015 Page(s): 574-577 |
Article Preview |
|
|
|
|
