DocumentCode
1273913
Title
Speaker Diarization Error Analysis Using Oracle Components
Author
Huijbregts, Marijn ; Van Leeuwen, David A. ; Wooters, Chuck
Author_Institution
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Nijmegen, Netherlands
Volume
20
Issue
2
fYear
2012
Firstpage
393
Lastpage
403
Abstract
In this paper, we describe an analysis of our speaker diarization system based on a series of oracle experiments. In this analysis, each system component is substituted by an oracle component that uses the reference transcripts to perform flawlessly. By placing the original components back into the system one at a time, either in a top-down or bottom-up manner, the performance of each individual system component is measured. The analysis approach can be applied to any speaker diarization system that consists of a concatenation of separate components. Our experimental findings are relevant for most RT09s diarization systems that all apply similar techniques. The analysis revealed that three components caused most errors: speech activity detection, the inability to handle overlapping speech, and robustness of the merging component to cluster impurity.
Keywords
speech processing; RT09s diarization system; cluster impurity; oracle component; reference transcript; speaker diarization error analysis; speech activity detection; Data models; Density estimation robust algorithm; Error analysis; Hidden Markov models; Merging; NIST; Speech; Rich transcription; speaker diarization; system analysis;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2162318
Filename
5955080
Link To Document