DocumentCode
1296189
Title
Bilinear Model-Based Maximum Likelihood Linear Regression Speaker Adaptation Framework
Author
Song, Hwa Jeon ; Kim, Hyung Soon
Author_Institution
Res. Inst. of Comput. Inf. & Commun., Pusan Nat. Univ., Busan, South Korea
Volume
16
Issue
12
fYear
2009
Firstpage
1063
Lastpage
1066
Abstract
This letter proposes a novel framework for speaker adaptation, using bilinear model-based maximum likelihood linear regression (MLLR) method. First, a set of speaker models is decomposed into the style factor identified as each speaker´s characteristics and the common content factor across the speakers, by the bilinear model. Then, using some adaptation data from a new speaker, the speaker-specific model is generated by properly adjusting the dimensionality of the content factor and estimating a new style factor simultaneously. Experimental results show that the proposed framework outperforms MLLR with fewer number of parameters to be estimated.
Keywords
maximum likelihood estimation; regression analysis; speech recognition; automatic speech recognition system; bilinear model; maximum likelihood linear regression method; speaker-specific model; Bilinear model; Maximum Likelihood Linear Regression (MLLR); speaker adaptation;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2009.2030030
Filename
5200528
Link To Document