DocumentCode :
589259
Title :
Taxonomic Dimensionality Reduction in Bayesian Text Classification
Author :
McAllister, R. ; Sheppard, John
Author_Institution :
Dept. of Comput. Sci., Montana State Univ., Bozeman, MT, USA
Volume :
1
fYear :
2012
fDate :
12-15 Dec. 2012
Firstpage :
508
Lastpage :
513
Abstract :
Lexical abstraction hierarchies can be leveraged to provide semantic information that characterizes features of text corpora as a whole. This information may be used to determine the classification utility of the dimensions that describe a dataset. This paper presents a new method for preparing a dataset for probabilistic classification by determining, a priori, the utility of a very small subset of taxonomically-related dimensions via a Discriminative Multinomial Naive Bayes process. We show that this method yields significant improvements over both Discriminative Multinomial Naive Bayes and Bayesian network classifiers alone.
Keywords :
belief networks; classification; statistical distributions; text analysis; Bayesian network classifier; Bayesian text classification; classification utility; discriminative multinomial naive Bayes process; lexical abstraction hierarchies; probabilistic classification; probability distribution; semantic information; taxonomic dimensionality reduction; text corpora; Bayesian methods; Hardware; Indexes; Microphones; Semantics; Taxonomy; Time division multiplexing; Bayesian Networks; WordNet; abstraction; classification; dimensionality reduction; lexicography; text;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
Type :
conf
DOI :
10.1109/ICMLA.2012.93
Filename :
6406664
Link To Document :
بازگشت