DocumentCode :
2492992
Title :
A comparative study of machine learning techniques in blog comments spam filtering
Author :
Romero, C. ; Garcia Valdez, M. ; Alanis, A.
fYear :
2010
fDate :
18-23 July 2010
Firstpage :
1
Lastpage :
7
Abstract :
In this paper we compare four machine learning techniques for blog comments spam filtering. the machine learning techniques are the Naïve Bayes, K-nearest neighbor, neural networks and the support vector machines. For this comparative study we used a blog comment corpus that has been affected by spam, which is our study case in this work. We classify the comments of this blog comments corpus, which have 50 pages and 1024 blog comments are classified in spam an non-spam. The percentage of spam of this corpus is 67%.
Keywords :
Bayes methods; Web sites; information filtering; learning (artificial intelligence); neural nets; support vector machines; unsolicited e-mail; blog comments spam filtering; k-nearest neighbor; machine learning; naïve Bayes; neural networks; support vector machines; Classification algorithms; Information services; Internet; Machine learning; Neurons; Training; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location :
Barcelona
ISSN :
1098-7576
Print_ISBN :
978-1-4244-6916-1
Type :
conf
DOI :
10.1109/IJCNN.2010.5596677
Filename :
5596677
Link To Document :
بازگشت