DocumentCode
2770515
Title
Efficient use of overlap information in speaker diarization
Author
Otterson, Scott ; Ostendorf, Mari
Author_Institution
Univ. of Washington, Seattle
fYear
2007
fDate
9-13 Dec. 2007
Firstpage
683
Lastpage
686
Abstract
Speaker overlap in meetings is thought to be a significant contributor to error in speaker diarization, but it is not clear if overlaps are problematic for speaker clustering and/or if errors could be addressed by assigning multiple labels in overlap regions. In this paper, we look at these issues experimentally, assuming perfect detection of overlaps, to assess the relative importance of these problems and the potential impact of overlap detection. With our best features, we find that detecting overlaps could potentially improve diarization accuracy by 15% relative, using a simple strategy of assigning speaker labels in overlap regions according to the labels of the neighboring segments. In addition, the use of cross-correlation features with MFCC´s reduces the performance gap due to overlaps, so that there is little gain from removing overlapped regions before clustering.
Keywords
speaker recognition; MFCC; overlap detection; overlap information; speaker clustering; speaker diarization; speaker identification; Error analysis; Independent component analysis; Microphones; NIST; Performance analysis; Performance gain; Source separation; Speech analysis; Speech processing; Testing; diarization; localization; overlap; speaker identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location
Kyoto
Print_ISBN
978-1-4244-1746-9
Electronic_ISBN
978-1-4244-1746-9
Type
conf
DOI
10.1109/ASRU.2007.4430194
Filename
4430194
Link To Document