Title
Spam Filtering: an Active Learning Approach using Incremental Clustering
Abstract
This paper introduces a method that deals with unwanted mail messages by combining active learning with incremental clustering. The proposed approach is motivated by the fact that the user cannot provide the correct category for all received messages. The email messages are divided into chronological batches (e.g. one per day). The user is asked to give the correct categories (labels) for the messages of the first batch and from then on the proposed algorithm decides when to ask for a new label, based on a clustering of the messages that is incrementally updated. We test different variants of the algorithm on a number of different datasets and show that it achieves very good results with only 2% of all email messages labelled by the user.
Year
DOI
Venue
2014
10.1145/2611040.2611059
WIMS
Keywords
Field
DocType
incremental clustering,algorithms,experimentation,semi-supervised learning,spam filtering,active learning,information storage and retrieval,machine learning,semi supervised learning
Ask price,Active learning,Semi-supervised learning,Computer science,Filter (signal processing),Artificial intelligence,Cluster analysis,Machine learning
Conference
Citations 
PageRank 
References 
5
0.40
19
Authors
3
Name
Order
Citations
PageRank
Kleanthi Georgala162.12
Aris Kosmopoulos2543.57
G. Paliouras336626.38