DocumentCode :
2772433
Title :
Classification of text documents
Author :
Li, Yonghong ; Jain, Anil K.
Author_Institution :
Dept. of Comput. Sci., Michigan State Univ., East Lansing, MI, USA
Volume :
2
fYear :
1998
fDate :
16-20 Aug 1998
Firstpage :
1295
Abstract :
We investigate four different classification methods for document classification: the naive Bayes classifier, nearest neighbor classifier, decision tree classifier, and subspace method. The classifiers were applied to seven-class Yahoo newsgroups individually and in combination. We study three classifier combination approaches: simple voting, dynamic classifier selection, and adaptive classifier combination. Our experimental results indicate that the naive Bayes classifier and the subspace method outperform the other two classification methods on our data sets. Combinations of multiple classifiers did not always improve classification accuracy. Among the three different combination approaches, the adaptive classifier combination method proposed here performed the best
Keywords :
Bayes methods; classification; decision trees; document handling; pattern classification; Yahoo newsgroups; adaptive classifier; decision tree classifier; dynamic classifier selection; naive Bayes classifier; nearest neighbor classifier; subspace method; text document classification; voting; Classification tree analysis; Computer science; Decision trees; Frequency; Humans; Linear discriminant analysis; Nearest neighbor searches; Space technology; Voting; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Conference_Location :
Brisbane, Qld.
ISSN :
1051-4651
Print_ISBN :
0-8186-8512-3
Type :
conf
DOI :
10.1109/ICPR.1998.711938
Filename :
711938
Link To Document :
بازگشت