Extraction of web texts using content-density distribution - Citegraph

Paper Info

Title
Extraction of web texts using content-density distribution

Abstract
We propose a method for grasping the content of each Web page and extracting a part of the Web page related to query keywords, in order to make more effective snippets of a Web search engine. We regard the content as a set of words in the text of a Web page, and we generate the content-density distribution by using both the position and the influence of the word. In our experiments, we found that the proposed method facilitated the recognition of the content of Web pages, as compared to conventional methods based on snippets.

Year	DOI	Venue
2011	10.1007/978-3-642-25631-8_25	AIRS
Keywords	Field	DocType
effective snippet,conventional method,web text,web search engine,content-density distribution,web page	Same-origin policy,Static web page,Web search engine,Web search query,Site map,Semantic Web Stack,Information retrieval,Web page,Computer science,Backlink	Conference
Volume	ISSN	Citations
7097	0302-9743	0
PageRank	References	Authors
0.34	9	3

Authors (3 rows)

Cited by (0 rows)

References (9 rows)

Name	Order	Citations	PageRank
Saori Kitahara	1	0	1.01
Koya Tamura	2	0	1.01
Kenji Hatano	3	30	10.41

1