Bilinear Model-Based Maximum Likelihood Linear Regression Speaker Adaptation Framework

Author

Song, Hwa Jeon ; Kim, Hyung Soon

Author_Institution

Res. Inst. of Comput. Inf. & Commun., Pusan Nat. Univ., Busan, South Korea

Volume

Issue

fYear

2009

Firstpage

1063

Lastpage

1066

Abstract

This letter proposes a novel framework for speaker adaptation, using bilinear model-based maximum likelihood linear regression (MLLR) method. First, a set of speaker models is decomposed into the style factor identified as each speaker´s characteristics and the common content factor across the speakers, by the bilinear model. Then, using some adaptation data from a new speaker, the speaker-specific model is generated by properly adjusting the dimensionality of the content factor and estimating a new style factor simultaneously. Experimental results show that the proposed framework outperforms MLLR with fewer number of parameters to be estimated.

Keywords

maximum likelihood estimation; regression analysis; speech recognition; automatic speech recognition system; bilinear model; maximum likelihood linear regression method; speaker-specific model; Bilinear model; Maximum Likelihood Linear Regression (MLLR); speaker adaptation;

fLanguage

English

Journal_Title

Signal Processing Letters, IEEE

Publisher

ieee

ISSN

1070-9908

Type

jour

DOI

10.1109/LSP.2009.2030030

Filename

5200528

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1296189