Title :
A new hybrid algorithm for speech recognition based on HMM segmentation and learning vector quantization
Author :
Katagiri, Shigeru ; Lee, Chin-Hui
Author_Institution :
Speech Res. Dept., AT&T Bell Labs., Murray Hill, NJ, USA
fDate :
10/1/1993 12:00:00 AM
Abstract :
A hybrid speech recognition algorithm based on the combination of hidden Markov models (HMMs) and learning vector quantization (LVQ) is presented. The LVQ training algorithms are capable of producing highly discriminative reference vectors for classifying static patterns, i.e., vectors with a fixed dimension. The HMM formulation has also been successfully applied to the recognition of dynamic speech patterns that are of variable duration. It is shown that by combining both LVQ´s discriminative power and the HMM´s capability of modeling temporal variations of speech in a hybrid algorithm, the performance of the original HMM-based speech recognizer is significantly improved. For a highly confusable vocabulary consisting of the nine American English E-set letters used in a multispeaker, isolated-word test mode, the average word accuracy of the baseline HMM recognizer is 67%. When LVQ is incorporated in the hybrid system, the word accuracy increases to 83%
Keywords :
hidden Markov models; speech recognition; vector quantisation; American English E-set letters; HMM segmentation; LVQ; average word accuracy; discriminative reference vectors; dynamic speech patterns; hidden Markov models; highly confusable vocabulary; hybrid algorithm; isolated-word test mode; learning vector quantization; speech recognition; static patterns classification; temporal variations; training algorithms; variable duration patterns; Algorithm design and analysis; Artificial neural networks; Helium; Hidden Markov models; Neural networks; Pattern recognition; Speech recognition; Testing; Vector quantization; Vocabulary;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on