• DocumentCode
    1273913
  • Title

    Speaker Diarization Error Analysis Using Oracle Components

  • Author

    Huijbregts, Marijn ; Van Leeuwen, David A. ; Wooters, Chuck

  • Author_Institution
    Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Nijmegen, Netherlands
  • Volume
    20
  • Issue
    2
  • fYear
    2012
  • Firstpage
    393
  • Lastpage
    403
  • Abstract
    In this paper, we describe an analysis of our speaker diarization system based on a series of oracle experiments. In this analysis, each system component is substituted by an oracle component that uses the reference transcripts to perform flawlessly. By placing the original components back into the system one at a time, either in a top-down or bottom-up manner, the performance of each individual system component is measured. The analysis approach can be applied to any speaker diarization system that consists of a concatenation of separate components. Our experimental findings are relevant for most RT09s diarization systems that all apply similar techniques. The analysis revealed that three components caused most errors: speech activity detection, the inability to handle overlapping speech, and robustness of the merging component to cluster impurity.
  • Keywords
    speech processing; RT09s diarization system; cluster impurity; oracle component; reference transcript; speaker diarization error analysis; speech activity detection; Data models; Density estimation robust algorithm; Error analysis; Hidden Markov models; Merging; NIST; Speech; Rich transcription; speaker diarization; system analysis;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2162318
  • Filename
    5955080