Title :
The SAIL speaker diarization system for analysis of spontaneous meetings
Author :
Han, Kyu J. ; Georgiou, Panayiotis G. ; Narayanan, Shrikanth S.
Author_Institution :
Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA
Abstract :
In this paper, we propose a novel approach to speaker diarization of spontaneous meetings in our own multimodal SmartRoom environment. The proposed speaker diarization system first applies a sequential clustering concept to segmentation of a given audio data source, and then performs agglomerative hierarchical clustering for speaker-specific classification (or speaker clustering) of speech segments. The speaker clustering algorithm utilizes an incremental Gaussian mixture cluster modeling strategy, and a stopping point estimation method based on information change rate. Through experiments on various meeting conversation data of approximately 200 minutes total length, this system is demonstrated to provide diarization error rate of 18.90% on average.
Keywords :
Gaussian distribution; pattern clustering; speech processing; SAIL speaker diarization system; Speech Analysis and Interpretation Laboratory; agglomerative hierarchical clustering; audio data source; incremental Gaussian mixture cluster modeling strategy; information change rate; multimodal SmartRoom environment; sequential clustering concept; speaker-specific classification; spontaneous meetings analysis; stopping point estimation method; Charge coupled devices; Charge-coupled image sensors; Clustering algorithms; Error analysis; Microphone arrays; Minutes; NIST; Smart cameras; Speech analysis; Viterbi algorithm;
Conference_Titel :
Multimedia Signal Processing, 2008 IEEE 10th Workshop on
Conference_Location :
Cairns, Qld
Print_ISBN :
978-1-4244-2294-4
Electronic_ISBN :
978-1-4244-2295-1
DOI :
10.1109/MMSP.2008.4665214