Title :
Batch Mode Active Learning Based Multi-view Text Classification
Author :
Zhang, Xua ; Zhao, Dong Yan ; Chen, Li Wei ; Min, Wang Hua
Author_Institution :
ICST, Peking Univ., Beijing, China
Abstract :
The goal of active learning is to select the most informative examples for manual labeling in order to reduce the effort involved in acquiring labeled examples, which is very important for large-scale text classification. However, most of the previous studies in active learning have focused on selecting a single unlabeled example at a time which could be inefficient since the model has to be retrained for every new labeled example. In this paper we propose a novel simple batch mode active learning(BMAL) method based on farthest-first traversal to select a number of informative examples for labeling simultaneously in each iteration. Furthermore, we combine the BMAL with a multi-view framework in order to improve its execution efficiency. The k nearest neighbor(kNN) model is used as the baseline classifier for its simplicity and efficiency. Extensive experiments on standard dataset have shown that our algorithm is more effective than the single mode counterpart and the baseline classifier.
Keywords :
learning (artificial intelligence); pattern classification; text analysis; batch mode active learning; farthest-first traversal method; k nearest neighbor method; multiview text classification; Fuzzy systems; Iterative algorithms; Labeling; Large-scale systems; Learning systems; Machine learning; Measurement uncertainty; Semisupervised learning; Testing; Text categorization; Batch Mode Active Learning; Multi-View Learning; k nearest neighbor(kNN); text classification;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3735-1
DOI :
10.1109/FSKD.2009.495