• DocumentCode
    2705214
  • Title

    Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks

  • Author

    Dong Yu ; Li Deng ; Xiaodong He ; Acero, Alex

  • Author_Institution
    Microsoft Res., Redmond, WA, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    Recently, we have developed a novel discriminative training method named large-margin minimum classification error (LM-MCE) training that incorporates the idea of discriminative margin into the conventional minimum classification error (MCE) training method. In our previous work, this novel approach was formulated specifically for the MCE training using the sigmoid loss function and its effectiveness was demonstrated on the TIDIGITS task alone. In this paper two additional contributions are made. First, we formulate LM-MCE as a Bayes risk minimization problem whose loss function not only includes empirical error rates but also a margin-bound risk. This new formulation allows us to extend the same technique to a wide variety of MCE based training. Second, we have successfully applied LM-MCE training approach to the Microsoft internal large vocabulary telephony speech recognition task (with 2000 hours of training data and 120K of vocabulary) and achieved significant recognition accuracy improvement across-the-board. To our best knowledge, this is the first time that the large-margin approach is demonstrated to be successful in large-scale speech recognition tasks.
  • Keywords
    Bayes methods; speech processing; speech recognition; Bayes risk minimization; discriminative training method; large vocabulary telephony speech recognition task; large-margin minimum classification error training; large-scale speech recognition tasks; margin-bound risk; sigmoid loss function; Automatic speech recognition; Error analysis; Helium; Hidden Markov models; Large-scale systems; Risk management; Speech recognition; Telephony; Training data; Vocabulary; discriminative training; large-margin learning; minimum classification error training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367275
  • Filename
    4218306