DocumentCode :
3485069
Title :
Speaker adaptation with an Exponential Transform
Author :
Povey, Daniel ; Zweig, Geoffrey ; Acero, Alex
Author_Institution :
Microsoft Res., Microsoft, Redmond, WA, USA
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
158
Lastpage :
163
Abstract :
In this paper we describe a linear transform that we call an Exponential Transform (ET), which integrates aspects of CMLLR, VTLN and STC/MLLT into a single transform with jointly trained components. Its main advantage is that a very small number of speaker-specific parameters is required, thus enabling effective adaptation with small amounts of speaker specific data. Our formulation shares some characteristics of Vocal Tract Length Normalization (VTLN), and is intended as a substitute for VTLN. The key part of the transform is controlled by a single speaker-specific parameter that is analogous to a VTLN warp factor. The transform has non-speaker-specific parameters that are learned from data, and we find that the axis along which male and female speakers differ is automatically learned. The exponential transform has no explicit notion of frequency warping, which makes it applicable in principle to non-standard features such as those derived from neural nets, or when the key axes may not be male-female. Based on our experiments with standard MFCC features, it appears to perform better than conventional VTLN.
Keywords :
speaker recognition; transforms; CMLLR; ET; STC-MLLT; conventional VTLN; exponential transform; female speakers; frequency warping; linear transform; male speakers; neural nets; speaker-specific parameter; standard MFCC; vocal tract length normalization; Adaptation models; Computational modeling; Hidden Markov models; Jacobian matrices; Training; Transforms; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163923
Filename :
6163923
Link To Document :
بازگشت