Title :
Speaker Recognition With Session Variability Normalization Based on MLLR Adaptation Transforms
Author :
Stolcke, Andreas ; Kajarekar, Sachin S. ; Ferrer, Luciana ; Shrinberg, Elizabeth
Author_Institution :
SRI Int., Menlo Park
Abstract :
We present a new modeling approach for speaker recognition that uses the maximum-likelihood linear regression (MLLR) adaptation transforms employed by a speech recognition system as features for support vector machine (SVM) speaker models. This approach is attractive because, unlike standard frame-based cepstral speaker recognition models, it normalizes for the choice of spoken words in text-independent speaker verification without data fragmentation. We discuss the basics of the MLLR-SVM approach, and show how it can be enhanced by combining transforms relative to multiple reference models, with excellent results on recent English NIST evaluation sets. We then show how the approach can be applied even if no full word-level recognition system is available, which allows its use on non-English data even without matching speech recognizers. Finally, we examine how two recently proposed algorithms for intersession variability compensation perform in conjunction with MLLR-SVM.
Keywords :
maximum likelihood detection; speaker recognition; support vector machines; MLLR adaptation transforms; maximum-likelihood linear regression; session variability normalization; speaker recognition; speech recognition; support vector machine; Cepstral analysis; Feature extraction; Helium; Laboratories; Linear regression; Maximum likelihood linear regression; NIST; Speaker recognition; Speech recognition; Support vector machines; Intersession variability compensation; maximum-likelihood linear regression–support vector machine (MLLR–SVM); speaker recognition;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2007.902859