• DocumentCode
    1856761
  • Title

    A study of on-line quasi-Bayes adaptation for CDHMM-based speech recognition

  • Author

    Qiang Huo ; Lee, Chin-Hui

  • Author_Institution
    ATR Interpreting Telephony Res. Labs., Kyoto, Japan
  • Volume
    2
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    705
  • Abstract
    We present a framework of quasi-Bayes (QB) learning of the parameters of the continuous density hidden Markov model (CDHMM) with Gaussian mixture state observation densities. Based on the theory of recursive Bayesian inference, the QB algorithm is designed to incrementally update the hyperparameters on the approximate posterior distribution and the CDHMM parameters simultaneously. By further introducing a simple forgetting mechanism to adjust the contribution of previously observed sample utterances, the algorithm is adaptive in nature and capable of performing an on-line adaptive learning using only the current sample utterance. It can thus be used to cope with the time-varying nature of some acoustic and environmental variabilities, including mismatches caused by changing speakers, channels, and transducers. As an example, the QB learning framework is applied to on-line speaker adaptation and its viability is confirmed in a series of comparative experiments using a 26-letter English alphabet vocabulary
  • Keywords
    Bayes methods; Gaussian processes; hidden Markov models; inference mechanisms; learning systems; speech recognition; CDHMM parameters; English alphabet vocabulary; Gaussian mixture state observation densities; acoustic variabilities; adaptive algorithm; approximate posterior distribution; continuous density hidden Markov model; environmental variabilities; forgetting mechanism; hyperparameters; on-line adaptive learning; on-line speaker adaptation; quasi-Bayes learning; recursive Bayesian inference; sample utterance; speech recognition; time-varying nature; Acoustic transducers; Algorithm design and analysis; Bayesian methods; Hidden Markov models; History; Inference algorithms; Learning systems; Loudspeakers; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.543218
  • Filename
    543218