Title :
Spam filtering with abductive networks
Author :
El-Alfy, El-Sayed M. ; Abdel-Aal, Radwan E.
Author_Institution :
Coll. of Comput. Sci. & Eng., King Fahd Univ. of Pet. & Miner., Dhahran
Abstract :
Spam messages pose a major threat to the usability of electronic mail. Spam wastes time and money for network users and administrators, consumes network bandwidth and storage space, and slows down email servers. In addition, it provides a medium to distribute harmful code and/or offensive content. In this paper, we investigate the application of abductive learning in filtering out spam messages. We study the performance for various network models on the spambase dataset. Results reveal that classification accuracies of 91.7% can be achieved using only 10 out of the available 57 content attributes. The attributes are selected automatically by the abductive learning algorithm as the most effective feature subset, thus achieving approximately 6:1 data reduction. Comparison with other techniques such as multi-layer perceptrons and naive Bayesian classifiers show that the abductive learning approach can provide better spam detection accuracies, e.g. false positive rates as low as 5.9% while requiring much shorter training times.
Keywords :
data reduction; information filtering; learning (artificial intelligence); unsolicited e-mail; abductive learning algorithm; abductive network; attribute selection; data reduction; electronic mail; feature subset; harmful code; spam message filtering; spambase dataset; Filtering; Neural networks; Unsolicited electronic mail;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4633784