• DocumentCode
    2767877
  • Title

    Batch Mode Active Learning Based Multi-view Text Classification

  • Author

    Zhang, Xua ; Zhao, Dong Yan ; Chen, Li Wei ; Min, Wang Hua

  • Author_Institution
    ICST, Peking Univ., Beijing, China
  • Volume
    7
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    472
  • Lastpage
    476
  • Abstract
    The goal of active learning is to select the most informative examples for manual labeling in order to reduce the effort involved in acquiring labeled examples, which is very important for large-scale text classification. However, most of the previous studies in active learning have focused on selecting a single unlabeled example at a time which could be inefficient since the model has to be retrained for every new labeled example. In this paper we propose a novel simple batch mode active learning(BMAL) method based on farthest-first traversal to select a number of informative examples for labeling simultaneously in each iteration. Furthermore, we combine the BMAL with a multi-view framework in order to improve its execution efficiency. The k nearest neighbor(kNN) model is used as the baseline classifier for its simplicity and efficiency. Extensive experiments on standard dataset have shown that our algorithm is more effective than the single mode counterpart and the baseline classifier.
  • Keywords
    learning (artificial intelligence); pattern classification; text analysis; batch mode active learning; farthest-first traversal method; k nearest neighbor method; multiview text classification; Fuzzy systems; Iterative algorithms; Labeling; Large-scale systems; Learning systems; Machine learning; Measurement uncertainty; Semisupervised learning; Testing; Text categorization; Batch Mode Active Learning; Multi-View Learning; k nearest neighbor(kNN); text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.495
  • Filename
    5360055