DocumentCode
2179896
Title
Improved speaker recognition when using i-vectors from multiple speech sources
Author
McLaren, Mitchell ; Van Leeuwen, David
Author_Institution
Centre for Language & Speech Technol., Radboud Univ., Nijmegen, Netherlands
fYear
2011
fDate
22-27 May 2011
Firstpage
5460
Lastpage
5463
Abstract
The concept of speaker recognition using i-vectors was recently introduced offering state-of-the-art performance. An i-vector is a compact representation of a speaker´s utterance after projection into a low-dimensional, total variability subspace trained using factor analysis. A secondary process involving linear discriminant analysis (LDA) is then used to improve the discrimination of i-vectors from different speakers. The newness of this technology invokes the question as to the best way to train the total variability subspace and LDA matrix when using speech collected from distinctly different sources. This paper presents a comparative study of a number of subspace training techniques and a novel source-normalised and-weighted LDA algorithm for the purpose of improving i-vector based speaker recognition under mis-matched evaluation conditions. Results from the NIST 2010 speaker recognition evaluation (SRE) suggest that accounting for source conditions in the LDA matrix as opposed to the total variability subspace training regime provides improved robustness to mis-matched evaluation conditions.
Keywords
matrix algebra; speaker recognition; LDA matrix; NIST 2010 speaker recognition evaluation; SRE; i-vectors; improved speaker recognition; linear discriminant analysis matrix; multiple speech sources; source-normalised-and-weighted LDA algorithm; Interviews; Microphones; NIST; Robustness; Speaker recognition; Speech; Training; i-vector; linear discriminant analysis; source conditions; speaker recognition; total variability;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947594
Filename
5947594
Link To Document