Title
You're Not Who You Claim to Be: Website Identity Check for Phishing Detection
Abstract
Phishing websites impersonate legitimate counterparts to lure users into visiting their websites. Once users visit a phishing website then the phishing website may steal users' private information or cause drive-by downloads. To detect a phishing website, human experts compare the claimed identity of a website with features in the website. For example, human experts often compare the domain name in the URL against the claimed identity. Most legitimate websites have domain names that match their identities, while phishing websites usually have less relevance between their domain names and their claimed (fake) identities. In addition to blacklists, whitelists, heuristics, and classifications used in the state-of-the-art systems, we propose to consider websites' identity claims. Our phishing detection system mimics this human expert behavior. Given a website, our system learns the identity that this website claims, and computes the textual relevance between this claimed identity and other features in the website. Our phishing detection system then uses this textual relevance as one of the features for classification, and our classifiers achieve more than 98% of true positive rate and very low false positive rate between 0.5% and 1%.
Year
DOI
Venue
2010
10.1109/ICCCN.2010.5560168
ICCCN
Keywords
Field
DocType
url,computer crime,private information,website identity check,human expert behavior,web sites,phishing detection,false positive rate,feature extraction,servers,logistics,training data
Training set,World Wide Web,Computer security,Computer science,Server,Feature extraction,Private information retrieval,Phishing detection
Conference
ISSN
ISBN
Citations 
1095-2055
978-1-4244-7114-0
2
PageRank 
References 
Authors
0.38
10
3
Name
Order
Citations
PageRank
Insoon Jo11048.15
Eunjin Jung212513.06
Heon Young Yeom321535.58