Title :
Multidimensional scaling for fast speaker clustering
Author :
Hsia, Chi-Chun ; Lee, Kuo-Yuan ; Chuang, Chih-Chieh ; Chiu, Yu-Hsien
Author_Institution :
ICT-Enabled Healthcare Program, Ind. Technol. Res. Inst. - South, Tainan, Taiwan
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
This study presents a fast speaker clustering method based on multidimensional scaling. Speech segments are trained as initial acoustic models. MDS is utilized to transform acoustic models to a space with the coordinate best preserve the distances or dissimilarity between models. Speaker clusters are clustered using vector quantization on the MDS coordinates and the acoustic speaker models are trained on MFCCs features for each cluster. Experimental results show the proposed method outperforms the baseline speaker clustering method in lower execution time.
Keywords :
pattern clustering; speaker recognition; acoustic speaker models; baseline speaker clustering method; fast speaker clustering; initial acoustic models; multidimensional scaling; speech segments; vector quantization; Acoustics; Conferences; Density estimation robust algorithm; Feature extraction; Matrix decomposition; Speech; Transforms; multidimensional scaling; speaker clustering; speaker models;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684888