Title
A comparison of implicit and explicit links for web page classification
Abstract
It is well known that Web-page classification can be enhanced by using hyperlinks that provide linkages between Web pages. However, in the Web space, hyperlinks are usually sparse, noisy and thus in many situations can only provide limited help in classification. In this paper, we extend the concept of linkages from explicit hyperlinks to implicit links built between Web pages. By observing that people who search the Web with the same queries often click on different, but related documents together, we draw implicit links between Web pages that are clicked after the same queries. Those pages are implicitly linked. We provide an approach for automatically building the implicit links between Web pages using Web query logs, together with a thorough comparison between the uses of implicit and explicit links in Web page classification. Our experimental results on a large dataset confirm that the use of the implicit links is better than using explicit links in classification performance, with an increase of more than 10.5% in terms of the Macro-F1 measurement.
Year
DOI
Venue
2006
10.1145/1135777.1135871
WWW
Keywords
DocType
ISBN
web page classification,macro-f1 measurement,implicit link,explicit hyperlinks,web-page classification,web query log,explicit link,web space,web page,classification performance,web pages
Conference
1-59593-323-9
Citations 
PageRank 
References 
38
1.71
19
Authors
4
Name
Order
Citations
PageRank
Dou Shen1122459.46
Jian-Tao Sun2162974.03
Qiang Yang317039875.69
Zheng Chen45019256.89