DocumentCode
2179128
Title
Discriminative Training for direct minimization of deletion, insertion and substitution errors
Author
Shin, Sunghwan ; Jung, Ho-Young ; Juang, Biing-Hwang
Author_Institution
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5328
Lastpage
5331
Abstract
In this paper, we follow the minimum error principle for acoustic modeling and formulate error objectives in insertion, deletion, and substitution separately for minimization during training. This new training paradigm generalized from the MVE criterion can explain the direct relationship between recognition errors and detection errors by re-interpreting deletion, insertion, and substitution errors as miss, false alarm, and miss/false-alarm errors happening together. Under the MVE criterion, by applying two mis-verification measures for miss and false alarm errors selectively along with the types of recognition error definition, we developed three individual objective training criteria, minimum deletion error (MDE), minimum insertion error (MIE), and minimum substitution error (MSE), of which each objective function can directly minimize each of the three types of the recognition errors. In the TIMIT phone recognition task, the experimental results confirm that each objective criterion of MDE, MIE, and MSE results in primarily minimizing its target error type, respectively. Furthermore, a simple combination of the individual objective criteria outperforms the conventional string-based MCE in the overall recognition error rate.
Keywords
speech recognition; MDE; MIE; MSE; MVE criterion; acoustic modeling; continuous speech recognition; discriminative training; minimum deletion error; minimum insertion error; minimum substitution errors; Hidden Markov models; Measurement uncertainty; Minimization; Speech; Speech recognition; Training; Training data; continuous speech recognition; discriminative training; minimum verification error;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947561
Filename
5947561
Link To Document