Title :
Discovering classification rules for email spam filtering with an ant colony optimization algorithm
Author :
El-Alfy, El-Sayed M.
Author_Institution :
Coll. of Comput. Sci. & Eng., King Fahd Univ. of Pet. & Miner., Dhahran
Abstract :
The cost estimates for receiving unsolicited commercial email messages, also known as spam, are threatening. Spam has serious negative impact on the usability of electronic mail and network resources. In addition, it provides a medium for distributing harmful code and/or offensive content. The work in this paper is motivated by the dramatic increase in the volume of spam traffic in recent years and the promising ability of ant colony optimization in data mining. Our goal is to develop an ant-colony based spam filter and to empirically evaluate its effectiveness in predicting spam messages. We also compare its performance to three other popular machine learning techniques: multi-layer perceptron, naive Bayes and Ripper classifiers. The preliminary results show that the developed model can be a remarkable alternative tool in filtering spam; yielding better accuracy with considerably smaller rule sets which highlight the important features in identifying the email category.
Keywords :
Bayes methods; data mining; information filtering; learning (artificial intelligence); multilayer perceptrons; optimisation; pattern classification; unsolicited e-mail; Ripper classifier; ant colony optimization algorithm; classification rule discovery; cost estimate; data mining; email spam filtering; harmful code distribution; machine learning technique; multilayer perceptron; naive Bayes classifier; unsolicited commercial electronic mail message; Ant colony optimization; Costs; Data mining; Electronic mail; Filtering algorithms; Filters; Machine learning; Telecommunication traffic; Unsolicited electronic mail; Usability;
Conference_Titel :
Evolutionary Computation, 2009. CEC '09. IEEE Congress on
Conference_Location :
Trondheim
Print_ISBN :
978-1-4244-2958-5
Electronic_ISBN :
978-1-4244-2959-2
DOI :
10.1109/CEC.2009.4983156