DocumentCode
2710201
Title
An Effective Algorithm for Improving the Performance of Naive Bayes for Text Classification
Author
GuoQiang
Author_Institution
Higher Vocational Coll., Shanghai Univ. Of Eng. Sci., Shanghai, China
fYear
2010
fDate
7-10 May 2010
Firstpage
699
Lastpage
701
Abstract
Naive Bayes algorithm is uncomplicated and effective in text classification and experiments. However, its performance is often imperfect because it does not model text well, and by inappropriate feature selection and some disadvantages of the Naive Bayes itself. This paper makes some modifications for Naive Bayes to improve the performance of Naive Bayes and the effect, condition as well, on categorization. Finally, the paper adopts this algorithm in Spam Filter categorization, a quite typical text classification. Some experiments were done with this method; results were compared with its previous method.
Keywords
Bayes methods; learning (artificial intelligence); pattern classification; text analysis; feature selection; naive Bayes algorithm; spam filter categorization; text classification; Application software; Educational institutions; Electronic mail; Frequency; High performance computing; Information filtering; Information filters; Machine learning; Research and development; Text categorization; component; machine learning; naive Bayes classification; text classification; words frequencies;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Research and Development, 2010 Second International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-0-7695-4043-6
Type
conf
DOI
10.1109/ICCRD.2010.160
Filename
5489530
Link To Document