Title :
Chinese Text Classification Based on the BVB Model
Author :
Liu, Rui ; Jiang, Minghu
Author_Institution :
Sch. of Humanities & Social Sci., Tsinghua Univ., Beijing, China
Abstract :
In this paper we take our effort to achieve a fast and accurate classifier: a BVB (BAM-Vote Box)-based framework is presented for text categorization by using ensemble method. The central idea is that combining two-class classifications for the multi-class tasks. This framework generates associating terms and extending the set of basis element, and includes a feature selection method, which can reduce the number of features and thus reduce computation and improve accuracy. A greedy algorithm is used in feature selection to overcome a serious bias of the unbalanced feature set. Then the new classifier, a BVB-based composite model, is introduced. Experimental results show that our approach can obtain high quality classifications and far less time-consuming than other neural networks, such as back propagation (BP) network and radial basis function (RBF) network, etc.
Keywords :
backpropagation; greedy algorithms; natural language processing; pattern classification; radial basis function networks; text analysis; BAM-Vote Box-based framework; BVB-based composite model; Chinese text classification; back propagation network; ensemble method; feature selection method; greedy algorithm; neural networks; radial basis function network; text categorization; Associative memory; Bismuth; Cognitive science; Computational linguistics; Computational modeling; Greedy algorithms; Magnesium compounds; Neural networks; Psychology; Text categorization; BVB Model; Text Classification;
Conference_Titel :
Semantics, Knowledge and Grid, 2008. SKG '08. Fourth International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-0-7695-3401-5
Electronic_ISBN :
978-0-7695-3401-5
DOI :
10.1109/SKG.2008.16