Abstract | ||
---|---|---|
This paper introduces a method that deals with unwanted mail messages by combining active learning with incremental clustering. The proposed approach is motivated by the fact that the user cannot provide the correct category for all received messages. The email messages are divided into chronological batches (e.g. one per day). The user is asked to give the correct categories (labels) for the messages of the first batch and from then on the proposed algorithm decides when to ask for a new label, based on a clustering of the messages that is incrementally updated. We test different variants of the algorithm on a number of different datasets and show that it achieves very good results with only 2% of all email messages labelled by the user. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1145/2611040.2611059 | WIMS |
Keywords | Field | DocType |
incremental clustering,algorithms,experimentation,semi-supervised learning,spam filtering,active learning,information storage and retrieval,machine learning,semi supervised learning | Ask price,Active learning,Semi-supervised learning,Computer science,Filter (signal processing),Artificial intelligence,Cluster analysis,Machine learning | Conference |
Citations | PageRank | References |
5 | 0.40 | 19 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kleanthi Georgala | 1 | 6 | 2.12 |
Aris Kosmopoulos | 2 | 54 | 3.57 |
G. Paliouras | 3 | 366 | 26.38 |