Title
Using neighborhood information for automated categorization of Web pages
Abstract
In this paper we discuss several issues related to the influence of expansion of a Web document representation on quality of topical categorization of Web pages. We consider a Web page expansion by using text content of it's linking pages. We show that naive expansion can grab too much noise and essentially harm categorization results. We present the approach to automated pruning of linking Web pages. We report that using our approach in forming a Web page representation always leads to better results than traditional single Web page categorization.
Year
Venue
Keywords
2003
ISTA
web pages
Field
DocType
Citations 
Web development,Web search engine,Static web page,World Wide Web,Web page,Semantic Web Stack,Computer science,Web standards,Data Web,Web navigation
Conference
0
PageRank 
References 
Authors
0.34
10
1
Name
Order
Citations
PageRank
Nadejda Panteleeva100.34