DocumentCode
977734
Title
Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition
Author
Huo, Qiang ; Chan, Chorkin ; Lee, Chin-Hui
Author_Institution
Dept. of Comput. Sci., Hong Kong Univ., Hong Kong
Volume
3
Issue
5
fYear
1995
fDate
9/1/1995 12:00:00 AM
Firstpage
334
Lastpage
345
Abstract
A theoretical framework for Bayesian adaptive training of the parameters of a discrete hidden Markov model (DHMM) and of a semi-continuous HMM (SCHMM) with Gaussian mixture state observation densities is presented. In addition to formulating the forward-backward MAP (maximum a posteriori) and the segmental MAP algorithms for estimating the above HMM parameters, a computationally efficient segmental quasi-Bayes algorithm for estimating the state-specific mixture coefficients in SCHMM is developed. For estimating the parameters of the prior densities, a new empirical Bayes method based on the moment estimates is also proposed. The MAP algorithms and the prior parameter specification are directly applicable to training speaker adaptive HMMs. Practical issues related to the use of the proposed techniques for HMM-based speaker adaptation are studied. The proposed MAP algorithms are shown to be effective especially in the cases in which the training or adaptation data are limited
Keywords
Bayes methods; Gaussian processes; adaptive estimation; hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; speech recognition; Bayesian adaptive learning; Gaussian mixture state observation densities; discrete hidden Markov model; forward-backward MAP; hidden Markov model; maximum a posteriori; moment estimates; prior densities; prior parameter specification; segmental MAP algorithm; segmental quasi-Bayes algorithm; semi-continuous HMM; speaker adaptive HMM; speech recognition; state-specific mixture coefficients; Bayesian methods; Computer science; Helium; Hidden Markov models; Inference algorithms; Maximum likelihood estimation; Parameter estimation; Prototypes; Speech recognition; State estimation;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.466661
Filename
466661
Link To Document