DocumentCode
1932497
Title
A comparison of several ensemble methods for text categorization
Author
Dong, Yan-Shi ; Han, Ke-Song
Author_Institution
Shanghai Jiao Tong Univ., China
fYear
2004
fDate
15-18 Sept. 2004
Firstpage
419
Lastpage
422
Abstract
Text categorization (TC), as an important domain of machine learning, has many unique traits, such as huge number of features, serious redundant features, dataset imbalance, etc. In this paper the various ensemble methods of naive Bayes classifiers and SVM classifiers are experimentally compared on the TC tasks. Besides, a new type of classifiers, moderated asymmetric naive Bayes classifiers, is proposed. Its advantages over the conventional naive Bayes classifiers in performance and computational efficiency are demonstrated.
Keywords
belief networks; learning (artificial intelligence); pattern classification; support vector machines; text analysis; Bayes classifier; machine learning; support vector machine classifier; text categorization; Bagging; Boosting; Computational efficiency; Machine learning; Neural networks; Niobium; Stacking; Support vector machine classification; Support vector machines; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Services Computing, 2004. (SCC 2004). Proceedings. 2004 IEEE International Conference on
Print_ISBN
0-7695-2225-4
Type
conf
DOI
10.1109/SCC.2004.1358033
Filename
1358033
Link To Document