Title :
Speaker Diarization Error Analysis Using Oracle Components
Author :
Huijbregts, Marijn ; Van Leeuwen, David A. ; Wooters, Chuck
Author_Institution :
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Nijmegen, Netherlands
Abstract :
In this paper, we describe an analysis of our speaker diarization system based on a series of oracle experiments. In this analysis, each system component is substituted by an oracle component that uses the reference transcripts to perform flawlessly. By placing the original components back into the system one at a time, either in a top-down or bottom-up manner, the performance of each individual system component is measured. The analysis approach can be applied to any speaker diarization system that consists of a concatenation of separate components. Our experimental findings are relevant for most RT09s diarization systems that all apply similar techniques. The analysis revealed that three components caused most errors: speech activity detection, the inability to handle overlapping speech, and robustness of the merging component to cluster impurity.
Keywords :
speech processing; RT09s diarization system; cluster impurity; oracle component; reference transcript; speaker diarization error analysis; speech activity detection; Data models; Density estimation robust algorithm; Error analysis; Hidden Markov models; Merging; NIST; Speech; Rich transcription; speaker diarization; system analysis;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2011.2162318