• DocumentCode
    1277335
  • Title

    A study on speaker adaptation of the parameters of continuous density hidden Markov models

  • Author

    Lee, Chin-Hui ; Lin, Chih-Heng ; Juang, Biing-hwang

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    39
  • Issue
    4
  • fYear
    1991
  • fDate
    4/1/1991 12:00:00 AM
  • Firstpage
    806
  • Lastpage
    814
  • Abstract
    For a speech-recognition system based on continuous-density hidden Markov models (CDHMM), speaker adaptation of the parameters of CDHMM is formulated as a Bayesian learning procedure. A speaker adaptation procedure which is easily integrated into the segmental k-means training procedure for obtaining adaptive estimates of the CDHMM parameters is presented. Some results for adapting both the mean and the diagonal covariance matrix of the Gaussian state observation densities of a CDHMM are reported. The results from tests on a 39-word English alpha-digit vocabulary in isolated word mode indicate that the speaker adaptation procedure achieves the same level of performance as that of a speaker-independent system, when one training token from each word is used to perform speaker adaptation. It shows that much better performance is achieved when two or more training tokens are used for speaker adaptation. When compared with the speaker-dependent system, it is found that the performance of speaker adaptation is always equal to or better than that of speaker-dependent training using the same amount of training data
  • Keywords
    Markov processes; parameter estimation; speech recognition; 39-word English alpha-digit vocabulary; Bayesian learning procedure; CDHMM parameters; Gaussian state observation densities; adaptive estimates; continuous density hidden Markov models; diagonal covariance matrix; isolated word mode; performance; speaker adaptation; speech-recognition system; Bayesian methods; Covariance matrix; Hidden Markov models; Performance evaluation; Performance gain; Speech recognition; System testing; Training data; Transducers; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/78.80902
  • Filename
    80902