• DocumentCode
    589164
  • Title

    Learning from Multiple Annotators: When Data is Hard and Annotators are Unreliable

  • Author

    Wolley, Chirine ; Quafafou, Mohamed

  • Author_Institution
    LSIS, Aix-Marseille Univ., Marseille, France
  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    514
  • Lastpage
    521
  • Abstract
    The crowd sourcing services became popular making it easy and fast to label datasets by multiple annotators in order to achieve supervised learning tasks. Unfortunately, in this context, annotators are not reliable as they may have different levels of experience or knowledge. Furthermore, the data to be labeled may also vary in their level of difficulty. How do we deal with hard data to label and unreliable annotators? In this paper, we present a probabilistic model to learn from multiple naive annotators, considering that annotators may decline to label an instance when they are unsure. Both errors and ignorance of annotators are integrated separately into the proposed Bayesian model. Experiments on several datasets show that our method achieves superior performance compared to other efficient learning algorithms.
  • Keywords
    Bayes methods; data mining; learning (artificial intelligence); Bayesian model; annotator errors; annotator ignorance; crowdsourcing services; dataset labelling; multiple naive annotator unreliability; probabilistic model; supervised learning algorithm; Approximation algorithms; Bayesian methods; Estimation; Probabilistic logic; Reliability; Supervised learning; Training; Bayesian Analysis; Crowdsourcing; Data Quality; Ignorance; Multiple Annotators;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.48
  • Filename
    6406483