• DocumentCode
    2179128
  • Title

    Discriminative Training for direct minimization of deletion, insertion and substitution errors

  • Author

    Shin, Sunghwan ; Jung, Ho-Young ; Juang, Biing-Hwang

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5328
  • Lastpage
    5331
  • Abstract
    In this paper, we follow the minimum error principle for acoustic modeling and formulate error objectives in insertion, deletion, and substitution separately for minimization during training. This new training paradigm generalized from the MVE criterion can explain the direct relationship between recognition errors and detection errors by re-interpreting deletion, insertion, and substitution errors as miss, false alarm, and miss/false-alarm errors happening together. Under the MVE criterion, by applying two mis-verification measures for miss and false alarm errors selectively along with the types of recognition error definition, we developed three individual objective training criteria, minimum deletion error (MDE), minimum insertion error (MIE), and minimum substitution error (MSE), of which each objective function can directly minimize each of the three types of the recognition errors. In the TIMIT phone recognition task, the experimental results confirm that each objective criterion of MDE, MIE, and MSE results in primarily minimizing its target error type, respectively. Furthermore, a simple combination of the individual objective criteria outperforms the conventional string-based MCE in the overall recognition error rate.
  • Keywords
    speech recognition; MDE; MIE; MSE; MVE criterion; acoustic modeling; continuous speech recognition; discriminative training; minimum deletion error; minimum insertion error; minimum substitution errors; Hidden Markov models; Measurement uncertainty; Minimization; Speech; Speech recognition; Training; Training data; continuous speech recognition; discriminative training; minimum verification error;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947561
  • Filename
    5947561