DocumentCode
2492992
Title
A comparative study of machine learning techniques in blog comments spam filtering
Author
Romero, C. ; Garcia Valdez, M. ; Alanis, A.
fYear
2010
fDate
18-23 July 2010
Firstpage
1
Lastpage
7
Abstract
In this paper we compare four machine learning techniques for blog comments spam filtering. the machine learning techniques are the Naïve Bayes, K-nearest neighbor, neural networks and the support vector machines. For this comparative study we used a blog comment corpus that has been affected by spam, which is our study case in this work. We classify the comments of this blog comments corpus, which have 50 pages and 1024 blog comments are classified in spam an non-spam. The percentage of spam of this corpus is 67%.
Keywords
Bayes methods; Web sites; information filtering; learning (artificial intelligence); neural nets; support vector machines; unsolicited e-mail; blog comments spam filtering; k-nearest neighbor; machine learning; naïve Bayes; neural networks; support vector machines; Classification algorithms; Information services; Internet; Machine learning; Neurons; Training; Web sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location
Barcelona
ISSN
1098-7576
Print_ISBN
978-1-4244-6916-1
Type
conf
DOI
10.1109/IJCNN.2010.5596677
Filename
5596677
Link To Document