Title :
Discriminative training for Bayesian sensing hidden Markov models
Author :
Saon, George ; Chien, Jen-Tzung
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We describe feature space and model space discriminative training for a new class of acoustic models called Bayesian sensing hidden Markov models (BS-HMMs). In BS-HMMs, speech data is represented by a set of state-dependent basis vectors. The relevance of a feature vector to different bases is determined by the precision matrices of the sensing weights. The basis vectors and the precision matrices of the reconstruction errors are jointly estimated by optimizing a maximum mutual information (MMI) criterion. Additionally, we discuss the training of an fMPE-style discriminative feature transformation under the same criterion given these models. Experimental results on an LVCSR task show that the proposed models outperform discriminatively trained conventional HMMs with Gaussian mixture models (GMMs). Cross-adapting the baseline GMM-HMMs to the BS-HMM output yields a 6% relative gain which indicates that the two systems make different errors.
Keywords :
belief networks; hidden Markov models; matrix algebra; speech processing; BS-HMM; Bayesian sensing hidden Markov models; LVCSR; MMI criterion; discriminative training; fMPE-style discriminative feature transformation; matrices; maximum mutual information criterion; speech data; state-dependent basis vectors; Acoustics; Bayesian methods; Hidden Markov models; Sensors; Smoothing methods; Training; Transforms; Bayesian learning; basis representation; discriminative training;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947558