DocumentCode
3579017
Title
A modular approach towards image spam filtering using multiple classifiers
Author
Das, Meghali ; Bhomick, Alexy ; Singh, Y.Jayanta ; Prasad, Vijay
Author_Institution
Dept. of Computer Science & Engineering and IT, Don Bosco College of Engineering and Technology, Guwahati, India
fYear
2014
Firstpage
1
Lastpage
8
Abstract
Image based spam is a recent trick developed by the spammers´ community with the intention of bypassing the successful text based spam filters. Most of the traditional text based filters have been based on Naïve Bayes classification combined with text categorization methods. This work concentrates in developing a spam filtering system that accurately blocks image spam. The system analyzes images sent as attachments extracting both textual and visual features. The rationale behind employing a combination of both kinds of features is that spammers usually embed the payload in an image hidden by various obscuring methods. We used SVM classifier for the classification of low level features. The use of a noncommercial OCR for extracting text from images also delivered better accuracy. The Voting scheme provides a final measure of the spamminess of the images with its decision based on the maximum probability assigned by the two classifiers.
Keywords
Feature extraction; Filtering; Image color analysis; Optical character recognition software; Text categorization; Unsolicited electronic mail; Content-based spam filtering; Image Spam; Low-level features; Spam Filtering; Text-Categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on
Print_ISBN
978-1-4799-3974-9
Type
conf
DOI
10.1109/ICCIC.2014.7238323
Filename
7238323
Link To Document