High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Literature Survey on Extraction of Top-k Lists from Web Pages

Author(s):

Darshana R. Dabhi , L.J.Institute Of Engineering & Technology; Ms. Jasmine Jha, L.J.Institute Of Engineering & Technology

Keywords:

Rank Search, Time Sensitive Queries, binning

Abstract

The web contains data in huge amounts. This data is a large source of information. All this information is in the form of structured or unstructured data. List is a crucial source of structured data on the web. Ranking the list data is generously important for information retrieval. Tremendous efforts have been done for extracting information from the structured data, especially from web tables, which contain quality information. Instead of focusing on context- free structured data, we aim to focus on context that we can spot, and then using the context to render less controlled information and proceed to its extraction. Here we highlight expensive as well as, rich source of information on the web, those are top-k web pages. Top-k web pages contain rich and quality information. They aim to identity the top attribute values for the entities of interest. Extraction of such lists can help answering engines to generate different fact and can act as a pre-processing step.

Other Details

Paper ID: IJSRDV2I9399
Published in: Volume : 2, Issue : 9
Publication Date: 01/12/2014
Page(s): 737-739

Article Preview

Download Article