• DocumentCode
    2029545
  • Title

    Chinese Text Classification Based on the BVB Model

  • Author

    Liu, Rui ; Jiang, Minghu

  • Author_Institution
    Sch. of Humanities & Social Sci., Tsinghua Univ., Beijing, China
  • fYear
    2008
  • fDate
    3-5 Dec. 2008
  • Firstpage
    376
  • Lastpage
    379
  • Abstract
    In this paper we take our effort to achieve a fast and accurate classifier: a BVB (BAM-Vote Box)-based framework is presented for text categorization by using ensemble method. The central idea is that combining two-class classifications for the multi-class tasks. This framework generates associating terms and extending the set of basis element, and includes a feature selection method, which can reduce the number of features and thus reduce computation and improve accuracy. A greedy algorithm is used in feature selection to overcome a serious bias of the unbalanced feature set. Then the new classifier, a BVB-based composite model, is introduced. Experimental results show that our approach can obtain high quality classifications and far less time-consuming than other neural networks, such as back propagation (BP) network and radial basis function (RBF) network, etc.
  • Keywords
    backpropagation; greedy algorithms; natural language processing; pattern classification; radial basis function networks; text analysis; BAM-Vote Box-based framework; BVB-based composite model; Chinese text classification; back propagation network; ensemble method; feature selection method; greedy algorithm; neural networks; radial basis function network; text categorization; Associative memory; Bismuth; Cognitive science; Computational linguistics; Computational modeling; Greedy algorithms; Magnesium compounds; Neural networks; Psychology; Text categorization; BVB Model; Text Classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantics, Knowledge and Grid, 2008. SKG '08. Fourth International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-0-7695-3401-5
  • Electronic_ISBN
    978-0-7695-3401-5
  • Type

    conf

  • DOI
    10.1109/SKG.2008.16
  • Filename
    4725942