DocumentCode
2017864
Title
Multidimensional scaling for fast speaker clustering
Author
Hsia, Chi-Chun ; Lee, Kuo-Yuan ; Chuang, Chih-Chieh ; Chiu, Yu-Hsien
Author_Institution
ICT-Enabled Healthcare Program, Ind. Technol. Res. Inst. - South, Tainan, Taiwan
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
296
Lastpage
299
Abstract
This study presents a fast speaker clustering method based on multidimensional scaling. Speech segments are trained as initial acoustic models. MDS is utilized to transform acoustic models to a space with the coordinate best preserve the distances or dissimilarity between models. Speaker clusters are clustered using vector quantization on the MDS coordinates and the acoustic speaker models are trained on MFCCs features for each cluster. Experimental results show the proposed method outperforms the baseline speaker clustering method in lower execution time.
Keywords
pattern clustering; speaker recognition; acoustic speaker models; baseline speaker clustering method; fast speaker clustering; initial acoustic models; multidimensional scaling; speech segments; vector quantization; Acoustics; Conferences; Density estimation robust algorithm; Feature extraction; Matrix decomposition; Speech; Transforms; multidimensional scaling; speaker clustering; speaker models;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684888
Filename
5684888
Link To Document