• DocumentCode
    921718
  • Title

    Discriminative training of dynamic programming based speech recognizers

  • Author

    Chang, Pao-Chung ; Juang, Biing-hwang

  • Author_Institution
    Telecommun. Labs., Minist. of Commun., Taiwan
  • Volume
    1
  • Issue
    2
  • fYear
    1993
  • fDate
    4/1/1993 12:00:00 AM
  • Firstpage
    135
  • Lastpage
    143
  • Abstract
    A new minimum recognition error formulation and a generalized probabilistic descent (GPD) algorithm are analyzed and used to accomplish discriminative training of a conventional dynamic-programming-based speech recognizer. The objective of discriminative training here is to directly minimize the recognition error rate. To achieve this, a formulation that allows controlled approximation of the exact error rate and renders optimization possible is used. The GPD method is implemented in a dynamic-time-warping (DTW)-based system. A linear discriminant function on the DTW distortion sequence is used to replace the conventional average DTW path distance. A series of speaker-independent recognition experiments using the highly confusible English E-set as the vocabulary showed a recognition rate of 84.4% compared to ~60% for traditional template training via clustering. The experimental results verified that the algorithm converges to a solution that achieves minimum error rate
  • Keywords
    dynamic programming; learning (artificial intelligence); speech recognition; DTW distortion sequence; discriminative training; dynamic programming; dynamic-time-warping; generalized probabilistic descent; highly confusible English E-set; linear discriminant function; minimum recognition error formulation; optimization; speaker-independent recognition; speech recognizer; Algorithm design and analysis; Artificial neural networks; Character recognition; Dynamic programming; Error analysis; Error correction; Probability distribution; Speech analysis; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.222873
  • Filename
    222873