High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Fast and Efficient Classification of Text Documents from Repository

Author(s):

Ajinkya Pawar , Matoshri College Of Engineering And Research Center, Nashik; Aparna Pulate, Matoshri College Of Engineering And Research Center, Nashik; Rushikesh Handge, Matoshri College Of Engineering And Research Center, Nashik; Vrushali Kolhe, Matoshri College Of Engineering And Research Center, Nashik

Keywords:

Data Mining, Master Taxonomy, Target Taxonomy, Classification, Cat log Integration

Abstract

An axiological abstracts affiliation assignment faced by online bartering portals and business seek engines is the affiliation of products advancing from assorted providers to their artifact catalogs. In this scenario, the bartering aperture has its own anatomy (the “master taxonomy”), while anniversary abstracts provider organizes its articles into a altered anatomy (the “provider taxonomy”). In this paper, we accede the botheration of allocation articles from the abstracts providers into the adept taxonomy, while authoritative use of the provider taxonomy information. Our access is based on a taxonomy-aware processing footfall that adjusts the after-effects of a text-based classifier to ensure that articles that are abutting calm in the provider anatomy abide abutting in the adept taxonomy. We codify this intuition as a structured anticipation enhancement problem. To the best of our knowledge, this is the aboriginal access that leverages the structure of taxonomies in adjustment to enhance archive integration. We adduce algorithms that are scalable and appropriately applicative to the large abstracts sets that are archetypal on the web. We apprise our algorithms on real-world abstracts and we appearance that taxonomy-aware classification provides a cogent advance over absolute approaches.

Other Details

Paper ID: IJSRDV3I2111
Published in: Volume : 3, Issue : 2
Publication Date: 01/05/2015
Page(s): 574-577

Article Preview

Download Article