Title
SURF: detecting and measuring search poisoning
Abstract
Search engine optimization (SEO) techniques are often abused to promote websites among search results. This is a practice known as blackhat SEO. In this paper we tackle a newly emerging and especially aggressive class of blackhat SEO, namely search poisoning. Unlike other blackhat SEO techniques, which typically attempt to promote a website's ranking only under a limited set of search keywords relevant to the website's content, search poisoning techniques disregard any term relevance constraint and are employed to poison popular search keywords with the sole purpose of diverting large numbers of users to short-lived traffic-hungry websites for malicious purposes. To accurately detect search poisoning cases, we designed a novel detection system called SURF. SURF runs as a browser component to extract a number of robust (i.e., difficult to evade) detection features from search-then-visit browsing sessions, and is able to accurately classify malicious search user redirections resulted from user clicking on poisoned search results. Our evaluation on real-world search poisoning instances shows that SURF can achieve a detection rate of 99.1% at a false positive rate of 0.9%. Furthermore, we applied SURF to analyze a large dataset of search-related browsing sessions collected over a period of seven months starting in September 2010. Through this long-term measurement study we were able to reveal new trends and interesting patterns related to a great variety of poisoning cases, thus contributing to a better understanding of the prevalence and gravity of the search poisoning problem.
Year
DOI
Venue
2011
10.1145/2046707.2046762
ACM Conference on Computer and Communications Security
Keywords
Field
DocType
poison popular search keyword,poisoning case,search poisoning case,search engine optimization,malicious search user,blackhat seo,real-world search poisoning instance,search poisoning technique,search poisoning problem,search result,search engine poisoning,search engine,measurement
False positive rate,World Wide Web,Search engine,Ranking,Computer security,Computer science,Search engine optimization,Search analytics,Spamdexing
Conference
Citations 
PageRank 
References 
31
1.28
13
Authors
3
Name
Order
Citations
PageRank
Long Lu169933.95
Roberto Perdisci2213797.99
Wenke Lee39351628.83