DocumentCode :
2690809
Title :
Integration of multiple annotators by aggregating experts and filtering novices
Author :
Zhang, Ping ; Obradovic, Zoran
Author_Institution :
Center for Data Analytics & Biomed. Inf., Temple Univ., Philadelphia, PA, USA
fYear :
2012
fDate :
4-7 Oct. 2012
Firstpage :
1
Lastpage :
6
Abstract :
Learning from noisy labels obtained from multiple annotators and without access to any true labels is an increasingly important problem in bioinformatics and biomedicine. In our method, this challenge is addressed by iteratively filtering low-quality annotators and estimating the consensus labels based only on the remaining experts that provide higher-quality annotations. Experiments on biomedical text classification and CASP9 protein disorder prediction tasks provide evidence that the proposed algorithm is more accurate than the majority voting and previously developed multi-annotator approaches. The benefit of using the new method is particularly large when low-quality annotators dominate. Moreover, the new algorithm also suggests the most relevant annotators for each instance, thus paving the way for understanding the behaviors of each annotator and building more reliable predictive models for bioinformatics applications.
Keywords :
bioinformatics; biological techniques; iterative methods; knowledge acquisition; molecular biophysics; proteins; text analysis; CASP9 protein disorder prediction tasks; aggregating experts; bioinformatics; biomedical text classification; biomedicine; consensus label estimation; data curation; filtering novices; high-quality annotations; iterative filter; low-quality annotators; multi-annotator approaches; multiple annotators; noisy labels; Accuracy; Classification algorithms; Estimation; Filtering; Noise measurement; Proteins; Sensitivity; crowdsourcing; data emotion; multiple noisy annotators; protein disorder prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2559-2
Electronic_ISBN :
978-1-4673-2558-5
Type :
conf
DOI :
10.1109/BIBM.2012.6392657
Filename :
6392657
Link To Document :
بازگشت