DocumentCode
3197957
Title
Identifying image spam authorship with variable bin-width histogram-based projective clustering
Author
Gao, Song ; Zhang, Chengcui ; Chen, Wei-Bang
Author_Institution
Univ. of Alabama at Birmingham, Birmingham, AL, USA
fYear
2011
fDate
11-15 July 2011
Firstpage
1
Lastpage
6
Abstract
In this paper we present a two-phase spam image clustering framework. The proposed framework performs a histogram based projective clustering on visual features in the first phase, followed by a text-based clustering in the second phase. There are several contributions in this study. First, we address the complex nature of spam image obfuscation techniques. Second, a multi-clue framework is developed to profile spam images of common spamming sources which provide evidence for tracking spam gangs. Third, projective clustering eliminates the need to choose among distance metrics for clustering analysis, while systematically exploring subspaces that correspond to clusters.
Keywords
image processing; pattern clustering; unsolicited e-mail; common spamming sources; distance metrics; image spam authorship identification; multiclue framework; spam gangs tracking; spam image obfuscation techniques; text-based clustering; two-phase spam image clustering framework; variable bin-width histogram-based projective clustering; visual features; Clustering algorithms; Entropy; Feature extraction; Guidelines; Histograms; Image color analysis; Image edge detection; Spam image clustering; histogram-based projective clustering; wavy spam image correction;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location
Barcelona
ISSN
1945-7871
Print_ISBN
978-1-61284-348-3
Electronic_ISBN
1945-7871
Type
conf
DOI
10.1109/ICME.2011.6012082
Filename
6012082
Link To Document