Abstract | ||
---|---|---|
Abstract Link spamming,intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming,on a page’s ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming. |
Year | Venue | Keywords |
---|---|---|
2006 | VLDB '06 Proceedings of the 32nd international conference on Very large data bases | host-level yahoo,mass estimation,web graph,specific target web page,spam mass,high link-based ranking,link spamming,search engine,heavyweight link spamming,spam detection,spam mass estimate,web pages |
Field | DocType | Citations |
Data mining,Page hijacking,Web page,Information retrieval,Ranking,Computer science,Spambot,Forum spam,Spam blog,Link farm,Spamming | Conference | 47 |
PageRank | References | Authors |
2.74 | 13 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Z. Gyongi | 1 | 47 | 2.74 |
P. Berkhin | 2 | 47 | 2.74 |
Héctor García-Molina | 3 | 24359 | 5652.13 |
Jan O. Pedersen | 4 | 6301 | 1177.07 |