Title
Searching for spam: detecting fraudulent accounts via web search
Abstract
Twitter users are harassed increasingly often by unsolicited messages that waste time and mislead users into clicking nefarious links. While increasingly powerful methods have been designed to detect spam, many depend on complex methods that require training and analyzing message content. While many of these systems are fast, implementing them in real time could present numerous challenges. Previous work has shown that large portions of spam originate from fraudulent accounts. We therefore propose a system which uses web searches to determine if a given account is fraudulent. The system uses the web searches to measure the online presence of a user and labels accounts with insufficient web presence to likely be fraudulent. Using our system on a collection of actual Twitter messages, we are able to achieve a true positive rate over 74% and a false positive rate below 11%, a detection rate comparable to those achieved by more expensive methods. Given its ability to operate before an account has produced a single tweet, we propose that our system could be used most effectively by combining it with slower more expensive machine learning methods as a first line of defense, alerting the system of fraudulent accounts before they have an opportunity to inject any spam into the ecosystem.
Year
DOI
Venue
2013
10.1007/978-3-642-36516-4_21
PAM
Keywords
Field
DocType
expensive machine,online presence,detection rate,fraudulent account,insufficient web presence,true positive rate,false positive rate,web search,labels account,expensive method
Online presence management,False positive rate,World Wide Web,Web presence,Social graph,Computer science,Computer security,True positive rate
Conference
Citations 
PageRank 
References 
6
0.57
10
Authors
2
Name
Order
Citations
PageRank
Marcel Flores1616.74
Aleksandar Kuzmanovic296071.99