Title :
Ham or spam? A comparative study for some content-based classification algorithms for email filtering
Author :
Saab, Salwa Adriana ; Mitri, Nicholas ; Awad, Maher
Author_Institution :
Fac. of Electr. & Comput. Eng., American Univ. of Beirut, Beirut, Lebanon
Abstract :
Spam emails are widely spreading to constitute a significant share of everyone´s daily inbox. Being a source of financial loss and inconvenience for the recipients, spam emails have to be filtered and separated from legitimate ones. This paper presents a survey of some popular filtering algorithms that rely on text classification to decide whether an email is unsolicited or not. A comparison among them is performed on the SpamBase dataset to identify the best classification algorithm in terms of accuracy, computational time, and precision/recall rates.
Keywords :
classification; content-based retrieval; information filtering; text analysis; unsolicited e-mail; SpamBase dataset; computational time; content-based classification algorithm; email filtering; filtering algorithms; financial loss; ham; spam emails; text classification; Accuracy; Artificial neural networks; Classification algorithms; Electronic mail; Support vector machines; Testing; Training;
Conference_Titel :
Mediterranean Electrotechnical Conference (MELECON), 2014 17th IEEE
Conference_Location :
Beirut
DOI :
10.1109/MELCON.2014.6820574