DocumentCode
2337446
Title
Utilizing improved Bayesian algorithm to identify blog comment spam
Author
Aiwu, Li ; Hongying, Liu
Author_Institution
Dept. of Comput. Sci., Guangdong Vocational Coll. of Posts & Telecom, Guangzhou, China
fYear
2012
fDate
3-5 June 2012
Firstpage
423
Lastpage
426
Abstract
In this paper, according to the blog website team dealing with comment spam demand more and more, analyzed the traditional Bayesian algorithm based on statistical method of defects, pointed out the deficiency in practical application, improved the rough Bayesian algorithm, utilized a string of comments appear based on the garbage probability value, calculated the number of geometrical average algorithm. The experimental results show that the modified Bayesian classification algorithm can effectively improve the classification of spam effect, garbage afr, legal review comments afr and average afr has dropped substantially.
Keywords
Bayes methods; Web sites; pattern classification; security of data; statistical analysis; Bayesian classification algorithm; average AFR classification; blog Website team; blog comment spam identification; comment spam; defects statistical method; garbage AFR classification; garbage probability value; geometrical average algorithm; legal review comments AFR classification; spam effect classification; Algorithm design and analysis; Bayesian methods; Blogs; Classification algorithms; Libraries; Unsolicited electronic mail; Bayesian algorithm; blog comment spam; geometric mean algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Applications (ISRA), 2012 IEEE Symposium on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4673-2205-8
Type
conf
DOI
10.1109/ISRA.2012.6219215
Filename
6219215
Link To Document