DocumentCode :
2701511
Title :
Evaluation of Proposed Modifications to MPE for Large Scale Discriminative Training
Author :
Povey, Daniel ; Kingsbury, Brian
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Minimum phone error (MPE) is an objective function for discriminative training of acoustic models for speech recognition. Recently several different objective functions related to MPE have been proposed. In this paper we compare implementations of three of these to MPE on English and Arabic broadcast news. The techniques investigated are minimum phone frame error (MPFE), minimum divergence (MD), and a physical-state level version of minimum Bayes risk which we call s-MBR. In the case of MPFE we observe improvements over MPE. We propose that the smoothing constant used in MPE should be scaled according to the average value of the counts in the statistics obtained from these objective functions.
Keywords :
Bayes methods; speech recognition; Arabic broadcast news; English; MPE; acoustic models; large scale discriminative training; minimum Bayes risk; minimum divergence; minimum phone error; minimum phone frame error; smoothing constant; speech recognition; Broadcasting; Equations; Large-scale systems; Lattices; Maximum likelihood estimation; Smoothing methods; Speech recognition; Statistics; Training data; Vocabulary; Discriminative Training; Minimum Bayes Risk; Minimum Divergence; Minimum Phone Error; Minimum Phone Frame Error;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366914
Filename :
4218102
Link To Document :
بازگشت