• DocumentCode
    3528811
  • Title

    A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition

  • Author

    You, Chang Huai ; Lee, Kong Aik ; Li, Haizhou

  • Author_Institution
    Inst. for Infocomm Res. (I2R), A*STAR, Singapore
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4221
  • Lastpage
    4224
  • Abstract
    Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally believed that speaker cues are also partly conveyed by second order statistics. In this paper, we extend the Bhattacharyya-based SVM kernel by constructing the supervector with the mean statistical vector and the covariance statistical vector. Comparing with the Kullback-Leibler (KL) kernel, we demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
  • Keywords
    Gaussian processes; covariance analysis; speaker recognition; support vector machines; Bhattacharyya distance; GMM supervector kernel; GMM-UBM mean interval; GUMI kernel; Gaussian mixture model; Kullback-Leibler kernel; covariance statistical vector; mean statistical vector; support vector machine; text independent speaker recognition; Cost function; Distance measurement; Error analysis; Kernel; NIST; Speaker recognition; Speech; Statistics; Support vector machines; Testing; Gaussian Mixture Model; NIST Evaluation; Speaker Verification; Supervector; Support Vector Machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960560
  • Filename
    4960560