Title
Email Spam Detection: A Symbiotic Feature Selection Approach Fostered By Evolutionary Computation
Abstract
The electronic mail (email) is nowadays an essential communication service being widely used by most Internet users. One of the main problems affecting this service is the proliferation of unsolicited messages (usually denoted by spam) which, despite the efforts made by the research community, still remains as an inherent problem affecting this Internet service. In this perspective, this work proposes and explores the concept of a novel symbiotic feature selection approach allowing the exchange of relevant features among distinct collaborating users, in order to improve the behavior of anti-spam filters. For such purpose, several Evolutionary Algorithms (EA) are explored as optimization engines able to enhance feature selection strategies within the anti-spam area. The proposed mechanisms are tested using a realistic incremental retraining evaluation procedure and resorting to a novel corpus based on the well-known Enron datasets mixed with recent spam data. The obtained results show that the proposed symbiotic approach is competitive also having the advantage of preserving end-users privacy.
Year
DOI
Venue
2013
10.1142/S0219622013500326
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING
Keywords
Field
DocType
Spam detection, content-based filtering, evolutionary algorithms, Naive Bayes, feature selection
Information system,Data mining,Feature selection,Naive Bayes classifier,Evolutionary algorithm,Computer science,Evolutionary computation,Artificial intelligence,Retraining,Machine learning,Email spam,The Internet
Journal
Volume
Issue
ISSN
12
4
0219-6220
Citations 
PageRank 
References 
3
0.41
20
Authors
5
Name
Order
Citations
PageRank
Pedro Sousa117425.25
Paulo Cortez236021.71
Rui Vaz351.18
Miguel Rocha451154.06
Miguel Rio527729.40