• DocumentCode
    1694359
  • Title

    Comparing two methods for crowdsourcing speech transcription

  • Author

    Sprugnoli, Rachele ; Moretti, G. ; Fuoli, Matteo ; Giuliani, Diego ; Bentivogli, Luisa ; Pianta, Emanuele ; Gretter, Roberto ; Brugnara, Fabio

  • Author_Institution
    CELCT - Center for the Evaluation of Language & Commun. Technol., Povo, Italy
  • fYear
    2013
  • Firstpage
    8116
  • Lastpage
    8120
  • Abstract
    This paper presents the results of an experimental study conducted with the aim of comparing two methods for crowdsourcing speech transcription that incorporate two different quality control mechanisms (i.e. explicit versus implicit) and that are based on two different processes (i.e. parallel versus iterative). In the Gold Standard method the same speech segment is transcribed in parallel by multiple contributors whose reliability is checked with respect to some reference transcriptions provided by experts. On the other hand, in the Dual Pathway method two independent groups of contributors work on the same set of transcriptions refining them in an iterative way until they converge, and thus eliminating the need to have reference transcriptions and to check transcription quality in a separate phase. These two methods were tested on about half an hour of broadcast news speech and for two different European languages, namely German and Italian. Both methods obtained good results in terms of Word Error Rate (WER) and compare well with the word disagreement rate of experts on the same data.
  • Keywords
    natural language processing; outsourcing; quality control; speech processing; European languages; German; Italian; WER; crowdsourcing; dual pathway method; gold standard method; news speech; quality control mechanisms; reference transcriptions; speech transcription; word disagreement; word error rate; Conferences; Gold; Iterative methods; Quality control; Reliability; Speech; Standards; CrowdFlower; Crowdsourcing speech transcription; Mechanical Turk; automatic speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6639246
  • Filename
    6639246