DocumentCode :
2891676
Title :
Cost-Sensitive Multi-strategy Active Annotation for Text Classification
Author :
Arora, Samarth ; Donmez, Pinar ; Nyberg, E.
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
1
fYear :
2012
fDate :
12-15 Dec. 2012
Firstpage :
421
Lastpage :
426
Abstract :
Different parts of an instance may be strong or weak indicators of the instance´s label. We propose a new annotation strategy, where in addition to an instance´s label, the annotator indicates parts of the instance that are rationales for its label. For two text classification tasks, we show that rationales provide a significant improvement in performance. Each instance (with or without rationales) may provide different incremental value to the learning algorithm. Annotation cost may also vary across instances and annotation strategies. We propose a cost-sensitive active learning approach for joint selection of instance and strategy that automatically determines which instances to query and whether to ask for rationales. We show that the proposed approach outperforms instance selection with a fixed strategy. While the additional cost for rationales may vary across annotators, user interface design, task, etc. We show that the proposed approach is able to select the right annotation strategy for each scenario.
Keywords :
learning (artificial intelligence); pattern classification; query processing; text analysis; annotation strategy selection; annotator rationales; cost-sensitive active learning approach; cost-sensitive multistrategy active annotation; incremental value; instance querrying; instance selection; performance improvement; text classification; Accuracy; Computational modeling; Correlation; Data models; Joints; Training; Uncertainty; Active learning with Multiple Strategies; Annotator Rationales; Cost-Sensitive Active Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
Type :
conf
DOI :
10.1109/ICMLA.2012.76
Filename :
6406699
Link To Document :
بازگشت