Title :
Evaluation of Proposed Modifications to MPE for Large Scale Discriminative Training
Author :
Povey, Daniel ; Kingsbury, Brian
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Minimum phone error (MPE) is an objective function for discriminative training of acoustic models for speech recognition. Recently several different objective functions related to MPE have been proposed. In this paper we compare implementations of three of these to MPE on English and Arabic broadcast news. The techniques investigated are minimum phone frame error (MPFE), minimum divergence (MD), and a physical-state level version of minimum Bayes risk which we call s-MBR. In the case of MPFE we observe improvements over MPE. We propose that the smoothing constant used in MPE should be scaled according to the average value of the counts in the statistics obtained from these objective functions.
Keywords :
Bayes methods; speech recognition; Arabic broadcast news; English; MPE; acoustic models; large scale discriminative training; minimum Bayes risk; minimum divergence; minimum phone error; minimum phone frame error; smoothing constant; speech recognition; Broadcasting; Equations; Large-scale systems; Lattices; Maximum likelihood estimation; Smoothing methods; Speech recognition; Statistics; Training data; Vocabulary; Discriminative Training; Minimum Bayes Risk; Minimum Divergence; Minimum Phone Error; Minimum Phone Frame Error;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366914