• DocumentCode
    3162518
  • Title

    Decoding network optimization using minimum transition error training

  • Author

    Kubo, Yotaro ; Watanabe, Shinji ; Nakamura, Atsushi

  • Author_Institution
    NTT Commun. Sci. Labs., Kyoto, Japan
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4197
  • Lastpage
    4200
  • Abstract
    The discriminative optimization of decoding networks is important for minimizing speech recognition error. Recently, several methods have been reported that optimize decoding networks by extending weighted finite state transducer (WFST)-based decoding processes to a linear classification process. In this paper, we model decoding processes by using conditional random fields (CRFs). Since the maximum mutual information (MMI) training technique is straightforwardly applicable for CRF training, several sophisticated training methods proposed as the variants of MMI can be incorporated in our decoding network optimization. This paper adapts the boosted MMI and the differenced MMI methods for decoding network optimization so that state transition errors are minimized in WFST decoding. We evaluated the proposed methods by conducting large-vocabulary continuous speech recognition experiments. We confirmed that the CRF-based framework and transition error minimization are efficient for improving the accuracy of automatic speech recognizers.
  • Keywords
    decoding; network coding; optimisation; speech coding; speech recognition; CRF training; CRF-based framework; MMI training technique; WFST-based decoding processes; automatic speech recognizers; conditional random fields; discriminative optimization; large-vocabulary continuous speech recognition experiments; linear classification process; maximum mutual information training technique; minimum transition error training; optimize decoding networks; sophisticated training methods; speech recognition error; state transition errors; transition error minimization; weighted finite state transducer-based decoding processes; Acoustics; Decoding; Hidden Markov models; Optimization; Speech recognition; Training; Vectors; Automatic speech recognition; conditional random fields; transition errors; weighed finite-state transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288844
  • Filename
    6288844