Title :
Feature selection and similarity coefficient based method for email spam filtering
Author :
Abdelrahim, Ali Ahmed A. ; Elhadi, Ammar Ahmed E. ; Ibrahim, Haidi ; Elmisbah, Naser
Author_Institution :
Fac. of Eng., Karary Univ., Omdurman, Sudan
Abstract :
Many threats in the real world can be related to activities of persons on the Internet. Spam is one of the most pressing security problems online. Spam filters try to identify likely spam either manually or automatically. Most of the spam datasets used in the spam filtering area of study deal with large amounts of data containing irrelevant and/or redundant features. This redundant information has a negative impact on the accuracy and detection rate of many methods that have been used for detection and filtering. In this study, statistical feature selection approach combined with similarity coefficients are used to improve the accuracy and detection rate for the spam detection and filtering. At the end, the study results based on email spam datasets show that our proposed approach enhanced the detection rate, false alarm rate and the accuracy.
Keywords :
Internet; computer network security; e-mail filters; feature extraction; information filtering; statistical analysis; unsolicited e-mail; Internet; accuracy rate improvement; detection rate improvement; email spam datasets; email spam filtering; false alarm rate; irrelevant data features; online security problems; redundant data features; redundant information; similarity coefficient-based method; spam detection; spam identification; statistical feature selection; Accuracy; Educational institutions; Feature extraction; Filtering; Optimization; Unsolicited electronic mail; Spam; feature selection; similarity coefficient; spam filtering;
Conference_Titel :
Computing, Electrical and Electronics Engineering (ICCEEE), 2013 International Conference on
Conference_Location :
Khartoum
Print_ISBN :
978-1-4673-6231-3
DOI :
10.1109/ICCEEE.2013.6634013