• DocumentCode
    2017864
  • Title

    Multidimensional scaling for fast speaker clustering

  • Author

    Hsia, Chi-Chun ; Lee, Kuo-Yuan ; Chuang, Chih-Chieh ; Chiu, Yu-Hsien

  • Author_Institution
    ICT-Enabled Healthcare Program, Ind. Technol. Res. Inst. - South, Tainan, Taiwan
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    296
  • Lastpage
    299
  • Abstract
    This study presents a fast speaker clustering method based on multidimensional scaling. Speech segments are trained as initial acoustic models. MDS is utilized to transform acoustic models to a space with the coordinate best preserve the distances or dissimilarity between models. Speaker clusters are clustered using vector quantization on the MDS coordinates and the acoustic speaker models are trained on MFCCs features for each cluster. Experimental results show the proposed method outperforms the baseline speaker clustering method in lower execution time.
  • Keywords
    pattern clustering; speaker recognition; acoustic speaker models; baseline speaker clustering method; fast speaker clustering; initial acoustic models; multidimensional scaling; speech segments; vector quantization; Acoustics; Conferences; Density estimation robust algorithm; Feature extraction; Matrix decomposition; Speech; Transforms; multidimensional scaling; speaker clustering; speaker models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684888
  • Filename
    5684888