Title :
A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition
Author :
You, Chang Huai ; Lee, Kong Aik ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res. (I2R), A*STAR, Singapore
Abstract :
Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally believed that speaker cues are also partly conveyed by second order statistics. In this paper, we extend the Bhattacharyya-based SVM kernel by constructing the supervector with the mean statistical vector and the covariance statistical vector. Comparing with the Kullback-Leibler (KL) kernel, we demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.
Keywords :
Gaussian processes; covariance analysis; speaker recognition; support vector machines; Bhattacharyya distance; GMM supervector kernel; GMM-UBM mean interval; GUMI kernel; Gaussian mixture model; Kullback-Leibler kernel; covariance statistical vector; mean statistical vector; support vector machine; text independent speaker recognition; Cost function; Distance measurement; Error analysis; Kernel; NIST; Speaker recognition; Speech; Statistics; Support vector machines; Testing; Gaussian Mixture Model; NIST Evaluation; Speaker Verification; Supervector; Support Vector Machine;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960560