Title
Spam email filtering using network-level properties
Abstract
Spam is serious problem that affects email users (e.g. phishing attacks, viruses and time spent reading unwanted messages). We propose a novel spam email filtering approach based on network-level attributes (e.g. the IP sender geographic coordinates) that are more persistent in time when compared to message content. This approach was tested using two classifiers, Naive Bayes (NB) and Support Vector Machines (SVM), and compared against bag-of-words models and eight blacklists. Several experiments were held with recent collected legitimate (ham) and non legitimate (spam) messages, in order to simulate distinct user profiles from two countries (USA and Portugal). Overall, the network-level based SVM model achieved the best discriminatory performance. Moreover, preliminary results suggests that such method is more robust to phishing attacks.
Year
Venue
Keywords
2010
ICDM
support vector machines,email user,spam email,distinct user profile,network-level property,phishing attack,network-level attribute,svm model,discriminatory performance,bag-of-words model,novel spam email,naive bayes,text mining,bag of words,support vector machine
Field
DocType
Volume
Bag-of-words model,Network level,Data mining,Email filtering,Phishing,Naive Bayes classifier,Computer science,Support vector machine,Communication source,Spam and Open Relay Blocking System,Artificial intelligence,Machine learning
Conference
6171
ISSN
ISBN
Citations 
0302-9743
3-642-14399-7
3
PageRank 
References 
Authors
0.43
14
5
Name
Order
Citations
PageRank
Paulo Cortez115712.29
antonio gomes correia230.43
Pedro Sousa317425.25
Miguel Rocha451154.06
Miguel Rio527729.40