• DocumentCode
    2608677
  • Title

    Directly Modeling of Correlation Matrices for GMM in Speaker Identification

  • Author

    Yao, Zhiqiang ; Zhou, Xi ; Dai, Beiqian ; Liu, Minghui ; Xie, Yanlu

  • Author_Institution
    MOE-Microsoft Key Lab. of Multimedia Comput. & Commun., Univ. of Sci. & Technol. of China, Beijing
  • Volume
    4
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    306
  • Lastpage
    309
  • Abstract
    In this paper, we present a new framework to model full covariance matrices of Gaussian components. In this framework, directly modeling the full correlation matrix instead of the full covariance matrix is our purpose, as the correlation matrix is the direct description of the correlation of inter-feature elements. In order to model full correlation matrices, we share linear transformations among components´ full correlation matrices. Thus, the full correlation matrix of each component is represented by a shared linear transformation and a component-specific diagonal correlation matrix. The transformation is used to help the diagonal correlation matrix to model the correlation of inter feature-vector elements more precisely. We evaluate our new framework on a Mandarin speaker identification task. Experiments show that above 35% reduction in speaker identification error rate is achieved compared with the best diagonal covariance models. Furthermore, our algorithm achieved better performance than STC does
  • Keywords
    Gaussian processes; covariance matrices; natural languages; speaker recognition; Guassian mixture model; Mandarin speaker identification; component-specific diagonal correlation matrix; covariance matrices; shared linear transformation; Clustering algorithms; Covariance matrix; Error analysis; Laboratories; Maximum likelihood estimation; Multimedia computing; Speaker recognition; Speech recognition; Unsolicited electronic mail; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2521-0
  • Type

    conf

  • DOI
    10.1109/ICPR.2006.453
  • Filename
    1699841