DocumentCode
2705214
Title
Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks
Author
Dong Yu ; Li Deng ; Xiaodong He ; Acero, Alex
Author_Institution
Microsoft Res., Redmond, WA, USA
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
Recently, we have developed a novel discriminative training method named large-margin minimum classification error (LM-MCE) training that incorporates the idea of discriminative margin into the conventional minimum classification error (MCE) training method. In our previous work, this novel approach was formulated specifically for the MCE training using the sigmoid loss function and its effectiveness was demonstrated on the TIDIGITS task alone. In this paper two additional contributions are made. First, we formulate LM-MCE as a Bayes risk minimization problem whose loss function not only includes empirical error rates but also a margin-bound risk. This new formulation allows us to extend the same technique to a wide variety of MCE based training. Second, we have successfully applied LM-MCE training approach to the Microsoft internal large vocabulary telephony speech recognition task (with 2000 hours of training data and 120K of vocabulary) and achieved significant recognition accuracy improvement across-the-board. To our best knowledge, this is the first time that the large-margin approach is demonstrated to be successful in large-scale speech recognition tasks.
Keywords
Bayes methods; speech processing; speech recognition; Bayes risk minimization; discriminative training method; large vocabulary telephony speech recognition task; large-margin minimum classification error training; large-scale speech recognition tasks; margin-bound risk; sigmoid loss function; Automatic speech recognition; Error analysis; Helium; Hidden Markov models; Large-scale systems; Risk management; Speech recognition; Telephony; Training data; Vocabulary; discriminative training; large-margin learning; minimum classification error training;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367275
Filename
4218306
Link To Document