DocumentCode :
143610
Title :
Ham or spam? A comparative study for some content-based classification algorithms for email filtering
Author :
Saab, Salwa Adriana ; Mitri, Nicholas ; Awad, Maher
Author_Institution :
Fac. of Electr. & Comput. Eng., American Univ. of Beirut, Beirut, Lebanon
fYear :
2014
fDate :
13-16 April 2014
Firstpage :
339
Lastpage :
343
Abstract :
Spam emails are widely spreading to constitute a significant share of everyone´s daily inbox. Being a source of financial loss and inconvenience for the recipients, spam emails have to be filtered and separated from legitimate ones. This paper presents a survey of some popular filtering algorithms that rely on text classification to decide whether an email is unsolicited or not. A comparison among them is performed on the SpamBase dataset to identify the best classification algorithm in terms of accuracy, computational time, and precision/recall rates.
Keywords :
classification; content-based retrieval; information filtering; text analysis; unsolicited e-mail; SpamBase dataset; computational time; content-based classification algorithm; email filtering; filtering algorithms; financial loss; ham; spam emails; text classification; Accuracy; Artificial neural networks; Classification algorithms; Electronic mail; Support vector machines; Testing; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mediterranean Electrotechnical Conference (MELECON), 2014 17th IEEE
Conference_Location :
Beirut
Type :
conf
DOI :
10.1109/MELCON.2014.6820574
Filename :
6820574
Link To Document :
بازگشت