Title
Link spam detection based on mass estimation
Abstract
Abstract Link spamming,intends to mislead search engines and trigger an artificially high link-based ranking of specific target web pages. This paper introduces the concept of spam mass, a measure of the impact of link spamming,on a page’s ranking. We discuss how to estimate spam mass and how the estimates can help identifying pages that benefit significantly from link spamming. In our experiments on the host-level Yahoo! web graph we use spam mass estimates to successfully identify tens of thousands of instances of heavy-weight link spamming.
Year
Venue
Keywords
2006
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
host-level yahoo,mass estimation,web graph,specific target web page,spam mass,high link-based ranking,link spamming,search engine,heavyweight link spamming,spam detection,spam mass estimate,web pages
Field
DocType
Citations 
Data mining,Page hijacking,Web page,Information retrieval,Ranking,Computer science,Spambot,Forum spam,Spam blog,Link farm,Spamming
Conference
47
PageRank 
References 
Authors
2.74
13
4
Name
Order
Citations
PageRank
Z. Gyongi1472.74
P. Berkhin2472.74
Héctor García-Molina3243595652.13
Jan O. Pedersen463011177.07