• DocumentCode
    143610
  • Title

    Ham or spam? A comparative study for some content-based classification algorithms for email filtering

  • Author

    Saab, Salwa Adriana ; Mitri, Nicholas ; Awad, Maher

  • Author_Institution
    Fac. of Electr. & Comput. Eng., American Univ. of Beirut, Beirut, Lebanon
  • fYear
    2014
  • fDate
    13-16 April 2014
  • Firstpage
    339
  • Lastpage
    343
  • Abstract
    Spam emails are widely spreading to constitute a significant share of everyone´s daily inbox. Being a source of financial loss and inconvenience for the recipients, spam emails have to be filtered and separated from legitimate ones. This paper presents a survey of some popular filtering algorithms that rely on text classification to decide whether an email is unsolicited or not. A comparison among them is performed on the SpamBase dataset to identify the best classification algorithm in terms of accuracy, computational time, and precision/recall rates.
  • Keywords
    classification; content-based retrieval; information filtering; text analysis; unsolicited e-mail; SpamBase dataset; computational time; content-based classification algorithm; email filtering; filtering algorithms; financial loss; ham; spam emails; text classification; Accuracy; Artificial neural networks; Classification algorithms; Electronic mail; Support vector machines; Testing; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mediterranean Electrotechnical Conference (MELECON), 2014 17th IEEE
  • Conference_Location
    Beirut
  • Type

    conf

  • DOI
    10.1109/MELCON.2014.6820574
  • Filename
    6820574