Title
A Multimodal Data Mining Framework for Revealing Common Sources of Spam Images.
Abstract
This paper proposes a multimodal framework that clusters spam images so that ones from the same spam source/cluster are grouped together. By identifying the common sources of spam images, we can provide evidence in tracking spam gangs. For this purpose, text recognition and visual feature extraction are performed. Subsequently, a two-level clustering method is applied where images with visually similar illustrations are first grouped together. Then the clustering result from the first level is further refined using the textual clues (if applicable) contained in spam images. Our experimental results show the effectiveness of the proposed framework.
Year
DOI
Venue
2009
10.4304/jmm.4.4.313-320
Journal of Multimedia
Keywords
DocType
Volume
index terms—spam image,clustering,multimodal analysis,botnet,computer forensics
Journal
4
Issue
Citations 
PageRank 
5
9
0.62
References 
Authors
12
6
Name
Order
Citations
PageRank
Chengcui Zhang178984.56
Wei-Bang Chen29718.16
Xin Chen3989.56
Richa Tiwari4112.67
Lin Yang51291116.88
Gary Warner611912.43