DocumentCode :
1749661
Title :
Eigenspace-based maximum a posteriori linear regression for rapid speaker adaptation
Author :
Chen, Kuan-ting ; Wang, Hsin-Min
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
317
Abstract :
We present an eigenspace-based approach toward prior density selection for the MAPLR framework. The proposed eigenspace-based MAPLR approach was developed by introducing a priori knowledge analysis on the training speakers via probabilistic principal component analysis (PPCA), so as to construct an eigenspace for speaker-specific full regression matrices as well as to derive a set of bases called eigen-matrices. The priors of MAPLR transformations for each outside speaker are then chosen in the space spanned by the first K eigen-matrices. By incorporating the PPCA model into the MAPLR scheme, the number of free parameters in choosing the priors can be effectively reduced, while the underlying structure of the acoustic space as well as the precise modeling of the inter-dimensional correlation among the model parameters can be well preserved. Both supervised and unsupervised adaptation experiments showed that the proposed approach significantly outperformed the conventional maximum likelihood linear regression (MLLR) approach using either diagonal or full regression matrices
Keywords :
Hessian matrices; adaptive systems; correlation methods; eigenvalues and eigenfunctions; principal component analysis; probability; speech processing; statistical analysis; unsupervised learning; PPCA model; acoustic space; continuous Mandarin Chinese telephone speech database; density selection; eigen-matrices; eigenspace-based MAP linear regression; eigenspace-based MAPLR; eigenspace-based maximum a posteriori linear regression; full regression matrices; inter-dimensional correlation; model parameters; probabilistic principal component analysis; speaker adaptation; supervised adaptation experiments; unsupervised adaptation experiments; Linear regression; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Principal component analysis; Regression tree analysis; Robustness; Sparse matrices; Speech recognition; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940831
Filename :
940831
Link To Document :
بازگشت