High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Extraction of Top K Data from Web Pages


Darshana Dabhi , L. J. Institute of Engineering & Technology; Jasmin Jha, L. J. Institute of Engineering & Technology


Rank Search, Time Sensitive Queries, binning


The web contains data in huge amounts. This data is a large source of information. All this information is in the form of structured or unstructured data. List is a crucial source of structured data on the web. Ranking the list data is generously important for information retrieval. Tremendous efforts have been done for extracting information from the structured data, especially from web tables, which contain quality information. Instead of focusing on context- free structured data, we aim to focus on context that we can spot, and then using the context to render less controlled information and proceed to its extraction. Here we highlight expensive as well as, rich source of information on the web, those are top-k web pages. Top-k web pages contain rich and quality information. They aim to identity the top attribute values for the entities of interest. Extraction of such lists can help answering engines to generate different fact and can act as a pre-processing step.

Other Details

Paper ID: IJSRDV3I40684
Published in: Volume : 3, Issue : 4
Publication Date: 01/07/2015
Page(s): 1281-1284

Article Preview

Download Article