Title
Extraction of web texts using content-density distribution
Abstract
We propose a method for grasping the content of each Web page and extracting a part of the Web page related to query keywords, in order to make more effective snippets of a Web search engine. We regard the content as a set of words in the text of a Web page, and we generate the content-density distribution by using both the position and the influence of the word. In our experiments, we found that the proposed method facilitated the recognition of the content of Web pages, as compared to conventional methods based on snippets.
Year
DOI
Venue
2011
10.1007/978-3-642-25631-8_25
AIRS
Keywords
Field
DocType
effective snippet,conventional method,web text,web search engine,content-density distribution,web page
Same-origin policy,Static web page,Web search engine,Web search query,Site map,Semantic Web Stack,Information retrieval,Web page,Computer science,Backlink
Conference
Volume
ISSN
Citations 
7097
0302-9743
0
PageRank 
References 
Authors
0.34
9
3
Name
Order
Citations
PageRank
Saori Kitahara101.01
Koya Tamura201.01
Kenji Hatano33010.41