Meerkat: detecting website defacements through image-based object recognition - Citegraph

Paper Info

Title
Meerkat: detecting website defacements through image-based object recognition

Abstract
Website defacements and website vandalism can inflict significant harm on the website owner through the loss of sales, the loss in reputation, or because of legal ramifications. Prior work on website defacements detection focused on detecting unauthorized changes to the web server, e.g., via host-based intrusion detection systems or file-based integrity checks. However, most prior approaches lack the capabilities to detect the most prevailing defacement techniques used today: code and/or data injection attacks, and DNS hijacking. This is because these attacks do not actually modify the code or configuration of the website, but instead they introduce new content or redirect the user to a different website. In this paper, we approach the problem of defacement detection from a different angle: we use computer vision techniques to recognize if a website was defaced, similarly to how a human analyst decides if a website was defaced when viewing it in a web browser. We introduce MEERKAT, a defacement detection system that requires no prior knowledge about the website's content or its structure, but only its URL. Upon detection of a defacement, the system notifies the website operator that his website is defaced, who can then take appropriate action. To detect defacements, MEERKAT automatically learns high-level features from screenshots of defaced websites by combining recent advances in machine learning, like stacked autoencoders and deep neural networks, with techniques from computer vision. These features are then used to create models that allow for the detection of newly-defaced websites. We show the practicality of MEERKAT on the largest website defacement dataset to date, comprising of 10,053,772 defacements observed between January 1998 and May 2014, and 2,554,905 legitimate websites. Overall, MEERKAT achieves true positive rates between 97.422% and 98.816%, false positive rates between 0.547% and 1.528%, and Bayesian detection rates between 98.583% and 99.845%, thus significantly outperforming existing approaches.

Year	Venue	Field
2015	Usenix Security Symposium	Internet privacy,World Wide Web,Injection attacks,Web browser,Computer security,Computer science,DNS hijacking,Website defacement,Image based,Intrusion detection system,Cognitive neuroscience of visual object recognition,Reputation
DocType	Citations	PageRank
Conference	7	0.60
References	Authors
33	3

Authors (3 rows)

Cited by (7 rows)

References (33 rows)

Name	Order	Citations	PageRank
Kevin Borgolte	1	67	8.48
Christopher Kruegel	2	8799	516.05
Giovanni Vigna	3	7121	507.72

1