DocumentCode :
3316997
Title :
A robust unsupervised speaker clustering of speech utterances
Author :
Zhang, Shilei ; Zhang, Shuwu ; Xu, Bo
Author_Institution :
High Technol. Innovation Center, Chinese Acad. of Sci., Beijing, China
fYear :
2005
fDate :
30 Oct.-1 Nov. 2005
Firstpage :
115
Lastpage :
120
Abstract :
This paper aims at developing and investigating efficient, robust and unsupervised algorithm for speaker clustering. Each utterance is modeled as a single Gaussian model distribution. A novel distance metric is proposed in this paper for the purpose of determining stopping criteria. The advantage of the proposed method is that it achieves comparable performance without requiring an adjusting threshold term. In this paper, we adopt the framework of agglomerative hierarchical clustering (AHC) with the merging criterion using Kullback-Leibler (KL) distance. The proposed stopping criterion can ensure a right number of speaker clusters. The efficiency of the proposed algorithm is demonstrated with various experiments on data from NIST and HUB5, respectively.
Keywords :
Gaussian distribution; merging; pattern clustering; speaker recognition; unsupervised learning; Gaussian model distribution; agglomerative hierarchical clustering; merging criterion; speech utterances; unsupervised speaker clustering; Acoustic measurements; Availability; Clustering algorithms; Loudspeakers; Merging; NIST; Paper technology; Robustness; Speech; Technological innovation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
Type :
conf
DOI :
10.1109/NLPKE.2005.1598718
Filename :
1598718
Link To Document :
بازگشت