DocumentCode :
2017864
Title :
Multidimensional scaling for fast speaker clustering
Author :
Hsia, Chi-Chun ; Lee, Kuo-Yuan ; Chuang, Chih-Chieh ; Chiu, Yu-Hsien
Author_Institution :
ICT-Enabled Healthcare Program, Ind. Technol. Res. Inst. - South, Tainan, Taiwan
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
296
Lastpage :
299
Abstract :
This study presents a fast speaker clustering method based on multidimensional scaling. Speech segments are trained as initial acoustic models. MDS is utilized to transform acoustic models to a space with the coordinate best preserve the distances or dissimilarity between models. Speaker clusters are clustered using vector quantization on the MDS coordinates and the acoustic speaker models are trained on MFCCs features for each cluster. Experimental results show the proposed method outperforms the baseline speaker clustering method in lower execution time.
Keywords :
pattern clustering; speaker recognition; acoustic speaker models; baseline speaker clustering method; fast speaker clustering; initial acoustic models; multidimensional scaling; speech segments; vector quantization; Acoustics; Conferences; Density estimation robust algorithm; Feature extraction; Matrix decomposition; Speech; Transforms; multidimensional scaling; speaker clustering; speaker models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684888
Filename :
5684888
Link To Document :
بازگشت