DocumentCode :
146555
Title :
Towards filtering spam mails using dimensionality reduction methods
Author :
Thomas, Julian ; Raj, Nisha S. ; Vinod, P.
Author_Institution :
Dept. of Comput. Sci. & Eng., SCMS Sch. of Eng. & Technol., Ernakulam, India
fYear :
2014
fDate :
25-26 Sept. 2014
Firstpage :
163
Lastpage :
168
Abstract :
Numerous methods based on the content based filtering is available for email spam identification. Dimensionality of the feature space is recognized as one of the leading factors that affect the efficiency in classifying mails. This study identifies feature selection techniques used in the general text classification for spam filtering. Also, the classification and prediction is performed using different entities of email such as header, body and subject. We present a comparative study of different feature selection methods. Through extensive experiments we demonstrated that Weighted Mutual Information feature selection with header and body of the emails is efficient in email classification.
Keywords :
e-mail filters; feature selection; information filtering; pattern classification; text analysis; unsolicited e-mail; content based filtering; dimensionality reduction methods; email classification; email spam identification; feature selection method; feature selection techniques; feature space; filtering spam mails; leading factors; spam filtering; text classification; weighted mutual Information feature selection; Accuracy; Mutual information; Postal services; Support vector machines; Unsolicited electronic mail; Vectors; classifier; dimensionality reduction; feature selection; spam filtering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
Conference_Location :
Noida
Print_ISBN :
978-1-4799-4237-4
Type :
conf
DOI :
10.1109/CONFLUENCE.2014.6949337
Filename :
6949337
Link To Document :
بازگشت