Title
Measurement and classification of humans and bots in internet chat
Abstract
The abuse of chat services by automated programs, known as chat bots, poses a serious threat to Internet users. Chat bots target popular chat networks to distribute spam and malware. In this paper, we first conduct a series of measurements on a large commercial chat network. Our measurements capture a total of 14 different types of chat bots ranging from simple to advanced. Moreover, we observe that human behavior is more complex than bot behavior. Based on the measurement study, we propose a classification system to accurately distinguish chat bots from human users. The proposed classification system consists of two components: (1) an entropy-based classifier and (2) a machine-learning-based classifier. The two classifiers complement each other in chat bot detection. The entropy-based classifier is more accurate to detect unknown chat bots, whereas the machine-learning-based classifier is faster to detect known chat bots. Our experimental evaluation shows that the proposed classification system is highly effective in differentiating bots from humans.
Year
Venue
Keywords
2008
USENIX Security Symposium
machine-learning-based classifier,unknown chat bots,popular chat network,differentiating bots,chat bot detection,chat bots,chat service,entropy-based classifier,proposed classification system,internet chat,large commercial chat network,classification system,human behavior,machine learning
Field
DocType
Citations 
World Wide Web,Computer science,Classifier (linguistics),Malware,The Internet
Conference
16
PageRank 
References 
Authors
1.74
16
4
Name
Order
Citations
PageRank
Steven Gianvecchio158326.81
Mengjun Xie221223.46
Zhenyu Wu366130.31
Haining Wang42574160.07