Title :
A basis method for robust estimation of constrained MLLR
Author :
Povey, Daniel ; Yao, Kaisheng
Author_Institution :
Microsoft, Redmond, WA, USA
Abstract :
Constrained Maximum Likelihood Linear Regression (CMLLR) is a widely used speaker adaptation technique in which an affine transform of the features is estimated for each speaker. However, when the amount of speech data available is very small (e.g. a few seconds), it can be difficult to get sufficiently accurate estimates of the transform parameters. In this paper we describe a method of estimating CMLLR robustly from less data. We do this by representing the CMLLR transform matrix as a weighted sum over basis matrices, where the basis is constructed in such a way that the most important variation is concentrated in the leading coefficients. Depending on the amount of data available, we can choose to estimate a smaller or larger number of coefficients.
Keywords :
affine transforms; matrix algebra; maximum likelihood estimation; regression analysis; speaker recognition; CMLLR transform matrix; affine transform; basis matrices; constrained maximum likelihood linear regression; robust estimation; speaker adaptation technique; weighted sum; Adaptation models; Covariance matrix; Estimation; Hidden Markov models; Robustness; Speech; Transforms; MLLR; Speaker Adaptation; Speech Recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947344