Title :
Identifying image spam authorship with variable bin-width histogram-based projective clustering
Author :
Gao, Song ; Zhang, Chengcui ; Chen, Wei-Bang
Author_Institution :
Univ. of Alabama at Birmingham, Birmingham, AL, USA
Abstract :
In this paper we present a two-phase spam image clustering framework. The proposed framework performs a histogram based projective clustering on visual features in the first phase, followed by a text-based clustering in the second phase. There are several contributions in this study. First, we address the complex nature of spam image obfuscation techniques. Second, a multi-clue framework is developed to profile spam images of common spamming sources which provide evidence for tracking spam gangs. Third, projective clustering eliminates the need to choose among distance metrics for clustering analysis, while systematically exploring subspaces that correspond to clusters.
Keywords :
image processing; pattern clustering; unsolicited e-mail; common spamming sources; distance metrics; image spam authorship identification; multiclue framework; spam gangs tracking; spam image obfuscation techniques; text-based clustering; two-phase spam image clustering framework; variable bin-width histogram-based projective clustering; visual features; Clustering algorithms; Entropy; Feature extraction; Guidelines; Histograms; Image color analysis; Image edge detection; Spam image clustering; histogram-based projective clustering; wavy spam image correction;
Conference_Titel :
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-61284-348-3
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2011.6012082