DocumentCode
3163520
Title
Speaker recognition with region-constrained MLLR transforms
Author
Stolcke, Andreas ; Mandal, Arindam ; Shriberg, Elizabeth
Author_Institution
Microsoft Speech Labs., Mountain View, CA, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
4397
Lastpage
4400
Abstract
It has been shown that standard cepstral speaker recognition models can be enhanced by region-constrained models, where features are extracted only from certain speech regions defined by linguistic or prosodic criteria. Such region-constrained models can capture features that are more stable, highly idiosyncratic, or simply complementary to the baseline system. In this paper we ask if another major class of speaker recognition models, those based on MLLR speaker adaptation transforms, can also benefit from region-constrained feature extraction. In our approach, we define regions based on phonetic and prosodic criteria, based on automatic speech recognition output, and perform MLLR estimation using only frames selected by these criteria. The resulting transform features are appended to those of a state-of-the-art MLLR speaker recognition system and jointly modeled by SVMs. Multiple regions can be added in this fashion. We find consistent gains over the baseline system in the SRE2010 speaker verification task.
Keywords
feature extraction; maximum likelihood estimation; regression analysis; speaker recognition; support vector machines; SRE2010 speaker verification; automatic speech recognition; maximum likelihood linear regression; phonetic criteria; prosodic criteria; region constrained MLLR transform; region constrained feature extraction; speaker adaptation transform; speaker recognition model; support vector machine; Adaptation models; Cepstral analysis; Feature extraction; Speaker recognition; Speech; Speech recognition; Transforms; MLLR-SVM; Speaker recognition; region-constrained speaker modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288894
Filename
6288894
Link To Document