DocumentCode
143610
Title
Ham or spam? A comparative study for some content-based classification algorithms for email filtering
Author
Saab, Salwa Adriana ; Mitri, Nicholas ; Awad, Maher
Author_Institution
Fac. of Electr. & Comput. Eng., American Univ. of Beirut, Beirut, Lebanon
fYear
2014
fDate
13-16 April 2014
Firstpage
339
Lastpage
343
Abstract
Spam emails are widely spreading to constitute a significant share of everyone´s daily inbox. Being a source of financial loss and inconvenience for the recipients, spam emails have to be filtered and separated from legitimate ones. This paper presents a survey of some popular filtering algorithms that rely on text classification to decide whether an email is unsolicited or not. A comparison among them is performed on the SpamBase dataset to identify the best classification algorithm in terms of accuracy, computational time, and precision/recall rates.
Keywords
classification; content-based retrieval; information filtering; text analysis; unsolicited e-mail; SpamBase dataset; computational time; content-based classification algorithm; email filtering; filtering algorithms; financial loss; ham; spam emails; text classification; Accuracy; Artificial neural networks; Classification algorithms; Electronic mail; Support vector machines; Testing; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Mediterranean Electrotechnical Conference (MELECON), 2014 17th IEEE
Conference_Location
Beirut
Type
conf
DOI
10.1109/MELCON.2014.6820574
Filename
6820574
Link To Document