DocumentCode :
3345798
Title :
The Optimization of Threshold-Based Naive Bayesian Algorithm
Author :
Wang Xin ; Jiang Hua
Author_Institution :
Sch. of Comput. Sci. & Control, Guilin Univ. of Electron. Technol., Guilin, China
fYear :
2009
fDate :
14-17 Oct. 2009
Firstpage :
762
Lastpage :
764
Abstract :
In order to realize the text classification and spam filtering, the Naive Bayesian algorithm estimate what class are the text in by basing on some statistical probability values in accordance with the characteristic in straining sample, but it is easy to expose the overflow problem, this article will optimize the algorithm by setting the threshold, the optimization strategy is comparing the times that the probability of each class exceed the threshold and the accumulated probability values at the same times. Compare with the existing method, experimental result show the new method not only can solve the overflow problem, but also improve the classification effect effectively.
Keywords :
Bayes methods; information filtering; probability; text analysis; unsolicited e-mail; accumulated probability values; optimization strategy; overflow problem; spam filtering; statistical probability values; text classification; threshold-based Naive Bayesian algorithm; Bayesian methods; Classification algorithms; Computer science; Electronic mail; Filtering algorithms; Information filtering; Information filters; Probability; Strain control; Text categorization; Naive Bayesian classification; information filtering; overflow; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genetic and Evolutionary Computing, 2009. WGEC '09. 3rd International Conference on
Conference_Location :
Guilin
Print_ISBN :
978-0-7695-3899-0
Type :
conf
DOI :
10.1109/WGEC.2009.161
Filename :
5402821
Link To Document :
بازگشت