DocumentCode :
918710
Title :
Estimating hidden Markov model parameters so as to maximize speech recognition accuracy
Author :
Bahl, Lalit R. ; Brown, Peter F. ; De Souza, Peter V. ; Mercer, Robert L.
Author_Institution :
IBM Thomas J. Watson Centre, Yorktown Heights, NY, USA
Volume :
1
Issue :
1
fYear :
1993
fDate :
1/1/1993 12:00:00 AM
Firstpage :
77
Lastpage :
83
Abstract :
The problem of estimating the parameter values of hidden Markov word models for speech recognition is addressed. It is argued that maximum-likelihood estimation of the parameters via the forward-backward algorithm may not lead to values which maximize recognition accuracy. An alternative estimation procedure called corrective training, which is aimed at minimizing the number of recognition errors, is described. Corrective training is similar to a well-known error-correcting training procedure for linear classifiers and works by iteratively adjusting the parameter values so as to make correct words more probable and incorrect words less probable. There are strong parallels between corrective training and maximum mutual information estimation; the relationship of these two techniques is discussed and a comparison is made of their performance. Although it has not been proved that the corrective training algorithm converges, experimental evidence suggests that it does, and that it leads to fewer recognition errors that can be obtained with conventional training methods
Keywords :
hidden Markov models; maximum likelihood estimation; parameter estimation; speech recognition; HMM; corrective training; error-correcting training; forward-backward algorithm; hidden Markov model parameters; hidden Markov word models; linear classifiers; maximum mutual information estimation; maximum-likelihood estimation; parameter estimation; recognition errors; speech recognition accuracy; Error correction; Hidden Markov models; Iterative algorithms; Maximum likelihood estimation; Mutual information; Probability distribution; Solids; Speech recognition; Speech synthesis; Topology;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.221369
Filename :
221369
Link To Document :
بازگشت