• DocumentCode
    968164
  • Title

    A new hybrid algorithm for speech recognition based on HMM segmentation and learning vector quantization

  • Author

    Katagiri, Shigeru ; Lee, Chin-Hui

  • Author_Institution
    Speech Res. Dept., AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    1
  • Issue
    4
  • fYear
    1993
  • fDate
    10/1/1993 12:00:00 AM
  • Firstpage
    421
  • Lastpage
    430
  • Abstract
    A hybrid speech recognition algorithm based on the combination of hidden Markov models (HMMs) and learning vector quantization (LVQ) is presented. The LVQ training algorithms are capable of producing highly discriminative reference vectors for classifying static patterns, i.e., vectors with a fixed dimension. The HMM formulation has also been successfully applied to the recognition of dynamic speech patterns that are of variable duration. It is shown that by combining both LVQ´s discriminative power and the HMM´s capability of modeling temporal variations of speech in a hybrid algorithm, the performance of the original HMM-based speech recognizer is significantly improved. For a highly confusable vocabulary consisting of the nine American English E-set letters used in a multispeaker, isolated-word test mode, the average word accuracy of the baseline HMM recognizer is 67%. When LVQ is incorporated in the hybrid system, the word accuracy increases to 83%
  • Keywords
    hidden Markov models; speech recognition; vector quantisation; American English E-set letters; HMM segmentation; LVQ; average word accuracy; discriminative reference vectors; dynamic speech patterns; hidden Markov models; highly confusable vocabulary; hybrid algorithm; isolated-word test mode; learning vector quantization; speech recognition; static patterns classification; temporal variations; training algorithms; variable duration patterns; Algorithm design and analysis; Artificial neural networks; Helium; Hidden Markov models; Neural networks; Pattern recognition; Speech recognition; Testing; Vector quantization; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.242488
  • Filename
    242488