DocumentCode :
1296189
Title :
Bilinear Model-Based Maximum Likelihood Linear Regression Speaker Adaptation Framework
Author :
Song, Hwa Jeon ; Kim, Hyung Soon
Author_Institution :
Res. Inst. of Comput. Inf. & Commun., Pusan Nat. Univ., Busan, South Korea
Volume :
16
Issue :
12
fYear :
2009
Firstpage :
1063
Lastpage :
1066
Abstract :
This letter proposes a novel framework for speaker adaptation, using bilinear model-based maximum likelihood linear regression (MLLR) method. First, a set of speaker models is decomposed into the style factor identified as each speaker´s characteristics and the common content factor across the speakers, by the bilinear model. Then, using some adaptation data from a new speaker, the speaker-specific model is generated by properly adjusting the dimensionality of the content factor and estimating a new style factor simultaneously. Experimental results show that the proposed framework outperforms MLLR with fewer number of parameters to be estimated.
Keywords :
maximum likelihood estimation; regression analysis; speech recognition; automatic speech recognition system; bilinear model; maximum likelihood linear regression method; speaker-specific model; Bilinear model; Maximum Likelihood Linear Regression (MLLR); speaker adaptation;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2009.2030030
Filename :
5200528
Link To Document :
بازگشت