DocumentCode
2891676
Title
Cost-Sensitive Multi-strategy Active Annotation for Text Classification
Author
Arora, Samarth ; Donmez, Pinar ; Nyberg, E.
Author_Institution
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
1
fYear
2012
fDate
12-15 Dec. 2012
Firstpage
421
Lastpage
426
Abstract
Different parts of an instance may be strong or weak indicators of the instance´s label. We propose a new annotation strategy, where in addition to an instance´s label, the annotator indicates parts of the instance that are rationales for its label. For two text classification tasks, we show that rationales provide a significant improvement in performance. Each instance (with or without rationales) may provide different incremental value to the learning algorithm. Annotation cost may also vary across instances and annotation strategies. We propose a cost-sensitive active learning approach for joint selection of instance and strategy that automatically determines which instances to query and whether to ask for rationales. We show that the proposed approach outperforms instance selection with a fixed strategy. While the additional cost for rationales may vary across annotators, user interface design, task, etc. We show that the proposed approach is able to select the right annotation strategy for each scenario.
Keywords
learning (artificial intelligence); pattern classification; query processing; text analysis; annotation strategy selection; annotator rationales; cost-sensitive active learning approach; cost-sensitive multistrategy active annotation; incremental value; instance querrying; instance selection; performance improvement; text classification; Accuracy; Computational modeling; Correlation; Data models; Joints; Training; Uncertainty; Active learning with Multiple Strategies; Annotator Rationales; Cost-Sensitive Active Learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location
Boca Raton, FL
Print_ISBN
978-1-4673-4651-1
Type
conf
DOI
10.1109/ICMLA.2012.76
Filename
6406699
Link To Document