Title :
Bayesian sensing hidden Markov models for speech recognition
Author :
Saon, George ; Chien, Jen-Tzung
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We introduce Bayesian sensing hidden Markov models (BS-HMMs) to represent speech data based on a set of state-dependent basis vectors. By incorporating the prior density of sensing weights, the relevance of a feature vector to different bases is determined by the corresponding precision parameters. The BS-HMM parameters, consisting of the basis vectors, the precision matrices of sensing weights and the precision matrices of reconstruction errors, are jointly estimated by maximizing the likelihood function, which is marginalized over the weight priors. We derive recursive solutions for the three parameters, which are expressed via maximum a posteriori estimates of the sensing weights. Experimental results on an LVCSR task show consistent gains over conventional HMMs with Gaussian mixture models for both ML and discriminative training scenarios.
Keywords :
Bayes methods; Gaussian processes; hidden Markov models; maximum likelihood estimation; recursive estimation; speech recognition; BS-HMM parameters; Bayesian sensing hidden Markov models; Gaussian mixture models; LVCSR; maximum a posteriori estimation; precision matrices; recursive solution; speech recognition; Adaptation models; Bayesian methods; Hidden Markov models; Sensors; Speech; Speech recognition; Training; Bayesian learning; Speech recognition; acoustic model; basis representation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947493