• DocumentCode
    3300212
  • Title

    A fuzzy similarity approach for automated spam filtering

  • Author

    El-Alfy, El-Sayed M. ; Al-Qunaieer, Fares S.

  • Author_Institution
    King Fahd Univ. of Pet. & Miner., Dhahran
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    544
  • Lastpage
    550
  • Abstract
    E-mail spam has become an epidemic problem that can negatively affect the usability of electronic mail as a communication means. Besides wasting users´ time and effort to scan and delete the massive amount of junk e-mails received; it consumes network bandwidth and storage space, slows down e-mail servers, and provides a medium to distribute harmful and/or offensive content. Several machine learning approaches have been applied to this problem. In this paper, we explore a new approach based on fuzzy similarity that can automatically classify e-mail messages as spam or legitimate. We study its performance for various conjunction and disjunction operators for several datasets. The results are promising as compared with a naive Bayesian classifier. Classification accuracy above 97% and low false positive rates are achieved in many test cases.
  • Keywords
    filtering theory; learning (artificial intelligence); unsolicited e-mail; automated spam filtering; electronic mail spam; fuzzy similarity; machine learning; naive Bayesian classifier; Bandwidth; Bayesian methods; Costs; Electronic mail; Filtering; Filters; Machine learning; Network servers; Telecommunication traffic; Unsolicited electronic mail;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on
  • Conference_Location
    Doha
  • Print_ISBN
    978-1-4244-1967-8
  • Electronic_ISBN
    978-1-4244-1968-5
  • Type

    conf

  • DOI
    10.1109/AICCSA.2008.4493585
  • Filename
    4493585