DocumentCode
3528811
Title
A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition
Author
You, Chang Huai ; Lee, Kong Aik ; Li, Haizhou
Author_Institution
Inst. for Infocomm Res. (I2R), A*STAR, Singapore
fYear
2009
fDate
19-24 April 2009
Firstpage
4221
Lastpage
4224
Abstract
Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally believed that speaker cues are also partly conveyed by second order statistics. In this paper, we extend the Bhattacharyya-based SVM kernel by constructing the supervector with the mean statistical vector and the covariance statistical vector. Comparing with the Kullback-Leibler (KL) kernel, we demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
Keywords
Gaussian processes; covariance analysis; speaker recognition; support vector machines; Bhattacharyya distance; GMM supervector kernel; GMM-UBM mean interval; GUMI kernel; Gaussian mixture model; Kullback-Leibler kernel; covariance statistical vector; mean statistical vector; support vector machine; text independent speaker recognition; Cost function; Distance measurement; Error analysis; Kernel; NIST; Speaker recognition; Speech; Statistics; Support vector machines; Testing; Gaussian Mixture Model; NIST Evaluation; Speaker Verification; Supervector; Support Vector Machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960560
Filename
4960560
Link To Document