DocumentCode :
1273913
Title :
Speaker Diarization Error Analysis Using Oracle Components
Author :
Huijbregts, Marijn ; Van Leeuwen, David A. ; Wooters, Chuck
Author_Institution :
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Nijmegen, Netherlands
Volume :
20
Issue :
2
fYear :
2012
Firstpage :
393
Lastpage :
403
Abstract :
In this paper, we describe an analysis of our speaker diarization system based on a series of oracle experiments. In this analysis, each system component is substituted by an oracle component that uses the reference transcripts to perform flawlessly. By placing the original components back into the system one at a time, either in a top-down or bottom-up manner, the performance of each individual system component is measured. The analysis approach can be applied to any speaker diarization system that consists of a concatenation of separate components. Our experimental findings are relevant for most RT09s diarization systems that all apply similar techniques. The analysis revealed that three components caused most errors: speech activity detection, the inability to handle overlapping speech, and robustness of the merging component to cluster impurity.
Keywords :
speech processing; RT09s diarization system; cluster impurity; oracle component; reference transcript; speaker diarization error analysis; speech activity detection; Data models; Density estimation robust algorithm; Error analysis; Hidden Markov models; Merging; NIST; Speech; Rich transcription; speaker diarization; system analysis;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2162318
Filename :
5955080
Link To Document :
بازگشت