A study of on-line quasi-Bayes adaptation for CDHMM-based speech recognition

Author

Qiang Huo ; Lee, Chin-Hui

Author_Institution

ATR Interpreting Telephony Res. Labs., Kyoto, Japan

Volume

2

fYear

1996

fDate

7-10 May 1996

Firstpage

705

Abstract

We present a framework of quasi-Bayes (QB) learning of the parameters of the continuous density hidden Markov model (CDHMM) with Gaussian mixture state observation densities. Based on the theory of recursive Bayesian inference, the QB algorithm is designed to incrementally update the hyperparameters on the approximate posterior distribution and the CDHMM parameters simultaneously. By further introducing a simple forgetting mechanism to adjust the contribution of previously observed sample utterances, the algorithm is adaptive in nature and capable of performing an on-line adaptive learning using only the current sample utterance. It can thus be used to cope with the time-varying nature of some acoustic and environmental variabilities, including mismatches caused by changing speakers, channels, and transducers. As an example, the QB learning framework is applied to on-line speaker adaptation and its viability is confirmed in a series of comparative experiments using a 26-letter English alphabet vocabulary

Keywords

Bayes methods; Gaussian processes; hidden Markov models; inference mechanisms; learning systems; speech recognition; CDHMM parameters; English alphabet vocabulary; Gaussian mixture state observation densities; acoustic variabilities; adaptive algorithm; approximate posterior distribution; continuous density hidden Markov model; environmental variabilities; forgetting mechanism; hyperparameters; on-line adaptive learning; on-line speaker adaptation; quasi-Bayes learning; recursive Bayesian inference; sample utterance; speech recognition; time-varying nature; Acoustic transducers; Algorithm design and analysis; Bayesian methods; Hidden Markov models; History; Inference algorithms; Learning systems; Loudspeakers; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on

Conference_Location

Atlanta, GA

ISSN

1520-6149

Print_ISBN

0-7803-3192-3

Type

conf

DOI

10.1109/ICASSP.1996.543218

Filename

543218