Title
Extracting Information from Semi-Structured Web Pages by Considering User's Context
Abstract
Nowadays, many users use web search engines to find and gather information. User faces an increasing amount of various semi-structured information sources. The issue of correlating, integrating and presenting related information to users becomes important. When a user uses a search engine such as Yahoo and Google to seek a specific information, the results are not only information about the availability of the desired information, but also information about other pages on which the desired information is mentioned. The number of selected pages is enormous. Therefore, the performance capabilities, the overlap among results for the same queries and limitations of web search engines are an important and large area of research. Extracting information from the web data sources also becomes very important because the massive and increasing amount of diverse semi-structured information sources in the Internet that are available to users, and the variety of web pages making the process of information extraction from web a challenging problem. It is more challenging when an extracted information which is relevant to a user might not be relevant to other users. Thus, an information extraction that considers user’s context more specifically user preferences would provide better results to the user. Thus, this paper proposed a framework for extracting information from semi-structured web pages by considering user’s context.
Year
Venue
Keywords
2010
IKE
web pages
Field
DocType
Citations 
Static web page,Web search engine,World Wide Web,HITS algorithm,Web page,Computer science,Web modeling,Web navigation,Web service,Web server
Conference
0
PageRank 
References 
Authors
0.34
9
5
Name
Order
Citations
PageRank
Mahmoud Shaker111.36
hamidah ibrahim221546.72
Alwan A. Ali3108.72
Aida Mustapha49026.18
Lili Nurliyana Abdullah5265.97