مرکز منطقه ای اطلاع رساني علوم و فناوري - Language-model-based detection cascade for efficient classification of image-based spam e-mail

DocumentCode :

2934319

Title :

Language-model-based detection cascade for efficient classification of image-based spam e-mail

Author :

Hsia, Jen-Hao ; Chen, Ming-Syan

Author_Institution :

Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan

fYear :

2009

fDate :

June 28 2009-July 3 2009

Firstpage :

1182

Lastpage :

1185

Abstract :

A new challenge in the spam email detection is the emergence of image spam, which consists in embedding the advertising messages into attached images to defeat the conventional text-based anti-spam technologies. New techniques are needed to filter these spam messages. In this paper, we proposed a prototype system to automatically classify an image directly as being spam or ham. The proposed method extracts latent topics in image to train a binary classifier for detecting spam images, and achieves more promising detection accuracy than conventional antispam approaches. In addition, a detection cascade is proposed to further reduce the computation overhead of the spam filter. Our algorithm is experimentally evaluated under a public spam image dataset, and shown to significantly improve both the detection accuracy and execution efficiency over the baseline approach.

Keywords :

image classification; information filtering; object detection; security of data; unsolicited e-mail; binary classifier; feature extraction; image classification; language-model-based detection; public spam image dataset; spam email detection; spam filter; text-based antispam technologies; Advertising; Electronic mail; Histograms; Image retrieval; Information filtering; Information filters; Optical character recognition software; Optical filters; Prototypes; Unsolicited electronic mail; Image spam; Near-duplicate image detection; Visual bag-of-words; pLSA;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on

Conference_Location :

New York, NY

ISSN :

1945-7871

Print_ISBN :

978-1-4244-4290-4

Electronic_ISBN :

1945-7871

Type :

conf

DOI :

10.1109/ICME.2009.5202711

Filename :

5202711

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2934319