• DocumentCode
    2492992
  • Title

    A comparative study of machine learning techniques in blog comments spam filtering

  • Author

    Romero, C. ; Garcia Valdez, M. ; Alanis, A.

  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    In this paper we compare four machine learning techniques for blog comments spam filtering. the machine learning techniques are the Naïve Bayes, K-nearest neighbor, neural networks and the support vector machines. For this comparative study we used a blog comment corpus that has been affected by spam, which is our study case in this work. We classify the comments of this blog comments corpus, which have 50 pages and 1024 blog comments are classified in spam an non-spam. The percentage of spam of this corpus is 67%.
  • Keywords
    Bayes methods; Web sites; information filtering; learning (artificial intelligence); neural nets; support vector machines; unsolicited e-mail; blog comments spam filtering; k-nearest neighbor; machine learning; naïve Bayes; neural networks; support vector machines; Classification algorithms; Information services; Internet; Machine learning; Neurons; Training; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2010 International Joint Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-6916-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2010.5596677
  • Filename
    5596677