DocumentCode
2690809
Title
Integration of multiple annotators by aggregating experts and filtering novices
Author
Zhang, Ping ; Obradovic, Zoran
Author_Institution
Center for Data Analytics & Biomed. Inf., Temple Univ., Philadelphia, PA, USA
fYear
2012
fDate
4-7 Oct. 2012
Firstpage
1
Lastpage
6
Abstract
Learning from noisy labels obtained from multiple annotators and without access to any true labels is an increasingly important problem in bioinformatics and biomedicine. In our method, this challenge is addressed by iteratively filtering low-quality annotators and estimating the consensus labels based only on the remaining experts that provide higher-quality annotations. Experiments on biomedical text classification and CASP9 protein disorder prediction tasks provide evidence that the proposed algorithm is more accurate than the majority voting and previously developed multi-annotator approaches. The benefit of using the new method is particularly large when low-quality annotators dominate. Moreover, the new algorithm also suggests the most relevant annotators for each instance, thus paving the way for understanding the behaviors of each annotator and building more reliable predictive models for bioinformatics applications.
Keywords
bioinformatics; biological techniques; iterative methods; knowledge acquisition; molecular biophysics; proteins; text analysis; CASP9 protein disorder prediction tasks; aggregating experts; bioinformatics; biomedical text classification; biomedicine; consensus label estimation; data curation; filtering novices; high-quality annotations; iterative filter; low-quality annotators; multi-annotator approaches; multiple annotators; noisy labels; Accuracy; Classification algorithms; Estimation; Filtering; Noise measurement; Proteins; Sensitivity; crowdsourcing; data emotion; multiple noisy annotators; protein disorder prediction;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
978-1-4673-2559-2
Electronic_ISBN
978-1-4673-2558-5
Type
conf
DOI
10.1109/BIBM.2012.6392657
Filename
6392657
Link To Document