Multidimensional scaling for fast speaker clustering

Author

Hsia, Chi-Chun ; Lee, Kuo-Yuan ; Chuang, Chih-Chieh ; Chiu, Yu-Hsien

Author_Institution

ICT-Enabled Healthcare Program, Ind. Technol. Res. Inst. - South, Tainan, Taiwan

fYear

2010

fDate

Nov. 29 2010-Dec. 3 2010

Firstpage

296

Lastpage

299

Abstract

This study presents a fast speaker clustering method based on multidimensional scaling. Speech segments are trained as initial acoustic models. MDS is utilized to transform acoustic models to a space with the coordinate best preserve the distances or dissimilarity between models. Speaker clusters are clustered using vector quantization on the MDS coordinates and the acoustic speaker models are trained on MFCCs features for each cluster. Experimental results show the proposed method outperforms the baseline speaker clustering method in lower execution time.

Keywords

pattern clustering; speaker recognition; acoustic speaker models; baseline speaker clustering method; fast speaker clustering; initial acoustic models; multidimensional scaling; speech segments; vector quantization; Acoustics; Conferences; Density estimation robust algorithm; Feature extraction; Matrix decomposition; Speech; Transforms; multidimensional scaling; speaker clustering; speaker models;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on

Conference_Location

Tainan

Print_ISBN

978-1-4244-6244-5

Type

conf

DOI

10.1109/ISCSLP.2010.5684888

Filename

5684888

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2017864