DocumentCode :
3724149
Title :
Quality Control for Crowdsourced Hierarchical Classification
Author :
Naoki Otani;Yukino Baba;Hisashi Kashima
Author_Institution :
Kyoto Univ., Kyoto, Japan
fYear :
2015
Firstpage :
937
Lastpage :
942
Abstract :
Repeated labeling is a widely adopted quality control method in crowdsourcing. This method is based on selecting one reliable label from multiple labels collected by workers because a single label from only one worker has a wide variance of accuracy. Hierarchical classification, where each class has a hierarchical relationship, is a typical task in crowdsourcing. However, direct applications of existing methods designed for multi-class classification have the disadvantage of discriminating among a large number of classes. In this paper, we propose a label aggregation method for hierarchical classification tasks. Our method takes the hierarchical structure into account to handle a large number of classes and estimate worker abilities more precisely. Our method is inspired by the steps model based on item response theory, which models responses of examinees to sequentially dependent questions. We considered hierarchical classification to be a question consisting of a sequence of subquestions and built a worker response model for hierarchical classification. We conducted experiments using real crowdsourced hierarchical classification tasks and demonstrated the benefit of incorporating a hierarchical structure to improve the label aggregation accuracy.
Keywords :
"Labeling","Crowdsourcing","Probabilistic logic","Pharmaceuticals","Reliability","Electronic mail","Quality control"
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2015 IEEE International Conference on
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2015.83
Filename :
7373415
Link To Document :
بازگشت