Title :
Improved discriminative training techniques for large vocabulary continuous speech recognition
Author :
Povey, D. ; Woodland, P.C.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
Investigates the use of discriminative training techniques for large vocabulary speech recognition with training datasets up to 265 hours. Techniques for improving lattice-based maximum mutual information estimation (MMIE) training are described and compared to frame discrimination (FD). An objective function which is an interpolation of MMIE and standard maximum likelihood estimation (MLE) is also discussed. Experimental results on both the Switchboard and North American Business News tasks show that MMIE training can yield significant performance improvements over standard MLE even for the most complex speech recognition problems with very large training sets
Keywords :
hidden Markov models; interpolation; maximum likelihood estimation; optimisation; probability; speech recognition; North American Business News; Switchboard; discriminative training techniques; frame discrimination; interpolation; large vocabulary continuous speech recognition; lattice-based maximum mutual information estimation training; maximum likelihood estimation; Equations; Hidden Markov models; Interpolation; Maximum likelihood estimation; Mutual information; Optimization methods; Parameter estimation; Speech recognition; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940763