DocumentCode :
1239827
Title :
Rapid online adaptation based on transformation space model evolution
Author :
Kim, Dong Kook ; Kim, Nam Soo
Author_Institution :
Sch. of Electr. Eng. & INMC, Seoul Nat. Univ., South Korea
Volume :
13
Issue :
2
fYear :
2005
fDate :
3/1/2005 12:00:00 AM
Firstpage :
194
Lastpage :
202
Abstract :
This paper presents a new approach to online linear regression adaptation of continuous density hidden Markov models based on transformation space model (TSM) evolution. The TSM which characterizes the a priori knowledge of the training speakers associated with maximum likelihood linear regression matrix parameters is effectively described in terms of the latent variable models such as the factor analysis or probabilistic principal component analysis. The TSM provides various sources of information such as the correlation information, the prior distribution, and the prior knowledge of the regression parameters that are very useful for rapid adaptation. The quasi-Bayes estimation algorithm is formulated to incrementally update the hyperparameters of the TSM and regression matrices simultaneously. The proposed TSM evolution is a general framework with batch TSM adaptation as a special case. Experiments on supervised speaker adaptation demonstrate that the proposed approach is more effective compared with the conventional quasi-Bayes linear regression technique when a small amount of adaptation data is available.
Keywords :
Bayes methods; hidden Markov models; matrix algebra; maximum likelihood estimation; principal component analysis; regression analysis; speech recognition; a priori knowledge; continuous density hidden Markov model; correlation information; maximum likelihood linear regression matrix parameter; online linear regression adaptation; probabilistic principal component analysis; quasiBayes estimation algorithm; transformation space model evolution; Adaptation model; Automatic speech recognition; Automatic testing; Degradation; Hidden Markov models; Information resources; Linear regression; Loudspeakers; Maximum likelihood linear regression; Principal component analysis; Factor analysis; online adaptation; prior evolution; probabilistic principal component analysis; quasi-Bayes estimate; rapid speaker adaptation; transformation space model;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.841427
Filename :
1395964
Link To Document :
بازگشت