• DocumentCode
    1296189
  • Title

    Bilinear Model-Based Maximum Likelihood Linear Regression Speaker Adaptation Framework

  • Author

    Song, Hwa Jeon ; Kim, Hyung Soon

  • Author_Institution
    Res. Inst. of Comput. Inf. & Commun., Pusan Nat. Univ., Busan, South Korea
  • Volume
    16
  • Issue
    12
  • fYear
    2009
  • Firstpage
    1063
  • Lastpage
    1066
  • Abstract
    This letter proposes a novel framework for speaker adaptation, using bilinear model-based maximum likelihood linear regression (MLLR) method. First, a set of speaker models is decomposed into the style factor identified as each speaker´s characteristics and the common content factor across the speakers, by the bilinear model. Then, using some adaptation data from a new speaker, the speaker-specific model is generated by properly adjusting the dimensionality of the content factor and estimating a new style factor simultaneously. Experimental results show that the proposed framework outperforms MLLR with fewer number of parameters to be estimated.
  • Keywords
    maximum likelihood estimation; regression analysis; speech recognition; automatic speech recognition system; bilinear model; maximum likelihood linear regression method; speaker-specific model; Bilinear model; Maximum Likelihood Linear Regression (MLLR); speaker adaptation;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2009.2030030
  • Filename
    5200528