DocumentCode :
1749668
Title :
Rapid speaker adaptation using a priori knowledge by eigenspace analysis of MLLR parameters
Author :
Wang, Nick J C ; Lee, Lin-Shun ; Seide, Frank ; Lin-Shan Lee
Author_Institution :
Philips Res. East Asia, Taipei, Taiwan
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
345
Abstract :
This paper considers the problem of rapid speaker adaptation in speech recognition. In particular, we exploit an approach based on combination of transformations, which utilizes the concepts of both maximum likelihood linear regression (MLLR) and eigenvoice adaptation. We analyze three different possible methods to realize the concept, and formulate a fast algorithm of maximum likelihood coefficient estimation for test speakers. It is found that the best approach can properly utilize the a priori knowledge of speaker-independent models in constructing the eigenspace for speaker characteristics, while using MLLR matrices in representing the specific speakers so as to reduce the on-line memory and computation requirement of the adaptation phase. This best approach leads to identical models relative to eigenvoice adaptation that is based on MLLR-adapted speaker models. The experimental results and discussions also provide a good analysis towards integration of the MLLR and eigenvoice approaches
Keywords :
eigenvalues and eigenfunctions; matrix algebra; maximum likelihood estimation; speech recognition; transforms; MLLR matrices; MLLR parameters; a priori knowledge; adaptation phase; eigenspace analysis; eigenvoice adaptation; maximum likelihood coefficient estimation; maximum likelihood linear regression; rapid speaker adaptation; speaker independent models; speech recognition; transformations; Adaptation model; Algorithm design and analysis; Covariance matrix; Eigenvalues and eigenfunctions; Hidden Markov models; Maximum likelihood estimation; Maximum likelihood linear regression; Principal component analysis; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940838
Filename :
940838
Link To Document :
بازگشت