DocumentCode
1691990
Title
Supervector Bayesian speaker comparison
Author
Borgstrom, Bengt J. ; McCree, Alan
Author_Institution
MIT Lincoln Lab., Lexington, MA, USA
fYear
2013
Firstpage
7693
Lastpage
7697
Abstract
In this paper we propose fully Bayesian speaker comparison of supervectors, which we refer to as SV-BSC, as a method for estimating whether a test cut was generated by the same speaker as an enrollment set. We derive the SV-BSC log-likelihood ratio of same-speaker to different-speaker hypotheses, and present solutions for model training and Bayesian scoring. We then show that if speaker and channel variability are assumed to inhabit a total variability subspace, SV-BSC scoring reduces to a form which requires only low-computation subspace operations. Finally, we show that common speaker recognition techniques such as Joint Factor Analysis (JFA) and i-vector Probabilistic Linear Discriminant Analysis (PLDA) are approximations to this full solution under certain additional assumptions. Experiments on the NIST 2010 SRE show SV-BSC to outperform a PLDA system.
Keywords
Bayes methods; probability; speaker recognition; statistical analysis; Bayesian scoring; JFA; NIST 2010 SRE; PLDA; SV-BSC log-likelihood ratio; channel variability; i-vector probabilistic linear discriminant analysis; joint factor analysis; low-computation subspace operations; model training; speaker hypotheses; speaker recognition techniques; supervector Bayesian speaker comparison; test cut; total variability subspace; Approximation methods; Bayes methods; Covariance matrices; NIST; Noise; Speaker recognition; Training; Bayesian speaker comparison; i-vector; speaker recognition; supervector; total variability;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6639160
Filename
6639160
Link To Document