Title :
Likelihood-ratio empirical kernels for i-vector based PLDA-SVM scoring
Author :
Man-Wai Mak ; Wei Rao
Author_Institution :
Dept. of Electron. & Inf. Eng., Hong Kong Polytech. Univ., Hong Kong, China
Abstract :
Likelihood ratio (LR) scoring in PLDA speaker verification systems only uses the information of background speakers implicitly. This paper exploits the notion of empirical kernel maps to incorporate background speaker information into the scoring process explicitly. This is achieved by training a scoring SVM for each target speaker based on a kernel in the empirical feature space. More specially, given a test i-vector and the identity of the target under test, a score vector is constructed by computing the LR scores of the test i-vector with respect to the target-speaker´s i-vectors and a set of background-speakers´ i-vectors. While in most situations, only one target-speaker i-vector is available for training the SVM, this paper demonstrates that if the enrollment utterance is sufficiently long, a number of target-speaker i-vectors can be generated by an utterance partitioning and resampling technique, resulting in much better scoring SVMs. Results on NIST 2010 SRE suggests that the idea of incorporating background speaker information into PLDA scoring through training speaker-dependent SVMs together with the utterance partitioning techniques can boost the performance of i-vector based PLDA systems significantly.
Keywords :
probability; signal sampling; speaker recognition; support vector machines; I-vector based PLDA-SVM scoring; LR scoring; PLDA speaker verification systems; background speaker information; empirical feature space; empirical kernel maps; enrollment utterance; likelihood ratio scoring; likelihood-ratio empirical kernels; probabilistic linear discriminant analysis; score vector; scoring process; target identity; target-speaker I-vector; test I-vector; utterance partitioning; utterance resampling; Acoustics; Kernel; NIST; Speech; Support vector machines; Training; Vectors; I-vectors; empirical kernel maps; likelihood ratio kernels; probabilistic linear discriminant analysis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639162