DocumentCode
2029545
Title
Chinese Text Classification Based on the BVB Model
Author
Liu, Rui ; Jiang, Minghu
Author_Institution
Sch. of Humanities & Social Sci., Tsinghua Univ., Beijing, China
fYear
2008
fDate
3-5 Dec. 2008
Firstpage
376
Lastpage
379
Abstract
In this paper we take our effort to achieve a fast and accurate classifier: a BVB (BAM-Vote Box)-based framework is presented for text categorization by using ensemble method. The central idea is that combining two-class classifications for the multi-class tasks. This framework generates associating terms and extending the set of basis element, and includes a feature selection method, which can reduce the number of features and thus reduce computation and improve accuracy. A greedy algorithm is used in feature selection to overcome a serious bias of the unbalanced feature set. Then the new classifier, a BVB-based composite model, is introduced. Experimental results show that our approach can obtain high quality classifications and far less time-consuming than other neural networks, such as back propagation (BP) network and radial basis function (RBF) network, etc.
Keywords
backpropagation; greedy algorithms; natural language processing; pattern classification; radial basis function networks; text analysis; BAM-Vote Box-based framework; BVB-based composite model; Chinese text classification; back propagation network; ensemble method; feature selection method; greedy algorithm; neural networks; radial basis function network; text categorization; Associative memory; Bismuth; Cognitive science; Computational linguistics; Computational modeling; Greedy algorithms; Magnesium compounds; Neural networks; Psychology; Text categorization; BVB Model; Text Classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantics, Knowledge and Grid, 2008. SKG '08. Fourth International Conference on
Conference_Location
Beijing
Print_ISBN
978-0-7695-3401-5
Electronic_ISBN
978-0-7695-3401-5
Type
conf
DOI
10.1109/SKG.2008.16
Filename
4725942
Link To Document