DocumentCode :
2770515
Title :
Efficient use of overlap information in speaker diarization
Author :
Otterson, Scott ; Ostendorf, Mari
Author_Institution :
Univ. of Washington, Seattle
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
683
Lastpage :
686
Abstract :
Speaker overlap in meetings is thought to be a significant contributor to error in speaker diarization, but it is not clear if overlaps are problematic for speaker clustering and/or if errors could be addressed by assigning multiple labels in overlap regions. In this paper, we look at these issues experimentally, assuming perfect detection of overlaps, to assess the relative importance of these problems and the potential impact of overlap detection. With our best features, we find that detecting overlaps could potentially improve diarization accuracy by 15% relative, using a simple strategy of assigning speaker labels in overlap regions according to the labels of the neighboring segments. In addition, the use of cross-correlation features with MFCC´s reduces the performance gap due to overlaps, so that there is little gain from removing overlapped regions before clustering.
Keywords :
speaker recognition; MFCC; overlap detection; overlap information; speaker clustering; speaker diarization; speaker identification; Error analysis; Independent component analysis; Microphones; NIST; Performance analysis; Performance gain; Source separation; Speech analysis; Speech processing; Testing; diarization; localization; overlap; speaker identification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430194
Filename :
4430194
Link To Document :
بازگشت