DocumentCode
3422280
Title
Boosted MMI for model and feature-space discriminative training
Author
Povey, Daniel ; Kanevsky, Dimitri ; Kingsbury, Brian ; Ramabhadran, Bhuvana ; Saon, George ; Visweswariah, Karthik
Author_Institution
TJ. Watson Res. Center, IBM, Yorktown Heights, NY
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4057
Lastpage
4060
Abstract
We present a modified form of the maximum mutual information (MMI) objective function which gives improved results for discriminative training. The modification consists of boosting the likelihoods of paths in the denominator lattice that have a higher phone error relative to the correct transcript, by using the same phone accuracy function that is used in Minimum Phone Error (MPE) training. We combine this with another improvement to our implementation of the Extended Baum-Welch update equations for MMI, namely the canceling of any shared part of the numerator and denominator statistics on each frame (a procedure that is already done in MPE). This change affects the Gaussian-specific learning rate. We also investigate another modification whereby we replace I-smoothing to the ML estimate with I-smoothing to the previous iteration´s value. Boosted MMI gives better results than MPE in both model and feature-space discriminative training, although not consistently.
Keywords
Gaussian processes; feature extraction; speech recognition; Baum-Welch update equations; Gaussian-specific learning rate; denominator lattice; feature space discriminative training; maximum mutual information; minimum phone error; objective function; phone accuracy function; Boosting; Equations; Error correction; Gaussian processes; Hidden Markov models; Lattices; Maximum likelihood estimation; Mutual information; Speech recognition; Statistics; Discriminative Training; MMI; MPE; Maximum Margin; Speech Recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518545
Filename
4518545
Link To Document