• DocumentCode
    3197957
  • Title

    Identifying image spam authorship with variable bin-width histogram-based projective clustering

  • Author

    Gao, Song ; Zhang, Chengcui ; Chen, Wei-Bang

  • Author_Institution
    Univ. of Alabama at Birmingham, Birmingham, AL, USA
  • fYear
    2011
  • fDate
    11-15 July 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper we present a two-phase spam image clustering framework. The proposed framework performs a histogram based projective clustering on visual features in the first phase, followed by a text-based clustering in the second phase. There are several contributions in this study. First, we address the complex nature of spam image obfuscation techniques. Second, a multi-clue framework is developed to profile spam images of common spamming sources which provide evidence for tracking spam gangs. Third, projective clustering eliminates the need to choose among distance metrics for clustering analysis, while systematically exploring subspaces that correspond to clusters.
  • Keywords
    image processing; pattern clustering; unsolicited e-mail; common spamming sources; distance metrics; image spam authorship identification; multiclue framework; spam gangs tracking; spam image obfuscation techniques; text-based clustering; two-phase spam image clustering framework; variable bin-width histogram-based projective clustering; visual features; Clustering algorithms; Entropy; Feature extraction; Guidelines; Histograms; Image color analysis; Image edge detection; Spam image clustering; histogram-based projective clustering; wavy spam image correction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2011 IEEE International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-61284-348-3
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2011.6012082
  • Filename
    6012082