DocumentCode :
1691990
Title :
Supervector Bayesian speaker comparison
Author :
Borgstrom, Bengt J. ; McCree, Alan
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
fYear :
2013
Firstpage :
7693
Lastpage :
7697
Abstract :
In this paper we propose fully Bayesian speaker comparison of supervectors, which we refer to as SV-BSC, as a method for estimating whether a test cut was generated by the same speaker as an enrollment set. We derive the SV-BSC log-likelihood ratio of same-speaker to different-speaker hypotheses, and present solutions for model training and Bayesian scoring. We then show that if speaker and channel variability are assumed to inhabit a total variability subspace, SV-BSC scoring reduces to a form which requires only low-computation subspace operations. Finally, we show that common speaker recognition techniques such as Joint Factor Analysis (JFA) and i-vector Probabilistic Linear Discriminant Analysis (PLDA) are approximations to this full solution under certain additional assumptions. Experiments on the NIST 2010 SRE show SV-BSC to outperform a PLDA system.
Keywords :
Bayes methods; probability; speaker recognition; statistical analysis; Bayesian scoring; JFA; NIST 2010 SRE; PLDA; SV-BSC log-likelihood ratio; channel variability; i-vector probabilistic linear discriminant analysis; joint factor analysis; low-computation subspace operations; model training; speaker hypotheses; speaker recognition techniques; supervector Bayesian speaker comparison; test cut; total variability subspace; Approximation methods; Bayes methods; Covariance matrices; NIST; Noise; Speaker recognition; Training; Bayesian speaker comparison; i-vector; speaker recognition; supervector; total variability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639160
Filename :
6639160
Link To Document :
بازگشت