Title
SPAM DETECTION USING DATA COMPRESSION AND SIGNATURES
Abstract
In this article, we introduce a novel method for spam detection based on a combination of Bayesian filtering, signature trees, and data compression–based similarity. Bayesian filtering is one of the most popular and most efficient algorithms for dealing with spam detection. The problem with Bayesian filtering is that it is unable to classify any e-mail without doubt and sometimes spam e-mails are classified as regular e-mails. This novel method sorts out this problem by using signature trees and data compression–based similarity. The main result of this article is an up to 99% improvement in spam detection precision using this novel method.
Year
DOI
Venue
2013
10.1080/01969722.2013.805110
Cybernetics and Systems
Keywords
Field
DocType
main result,spam detection using data,spam detection precision,novel method,data compression,spam e-mail,regular e-mail,novel method sort,efficient algorithm,spam detection,signature tree,spam
Bag-of-words model,Data mining,Computer science,Artificial intelligence,Data compression,Bayesian filtering,Machine learning
Journal
Volume
Issue
ISSN
44
6-7
0196-9722
Citations 
PageRank 
References 
2
0.39
18
Authors
4
Name
Order
Citations
PageRank
Michal Prilepok1326.45
Petr Berek220.39
Jan Platos328658.72
Václav Snasel41261210.53