مرکز منطقه ای اطلاع رساني علوم و فناوري - A new hybrid algorithm for speech recognition based on HMM segmentation and learning vector quantization

DocumentCode :

968164

Title :

A new hybrid algorithm for speech recognition based on HMM segmentation and learning vector quantization

Author :

Katagiri, Shigeru ; Lee, Chin-Hui

Author_Institution :

Speech Res. Dept., AT&T Bell Labs., Murray Hill, NJ, USA

Volume :

Issue :

fYear :

1993

fDate :

10/1/1993 12:00:00 AM

Firstpage :

421

Lastpage :

430

Abstract :

A hybrid speech recognition algorithm based on the combination of hidden Markov models (HMMs) and learning vector quantization (LVQ) is presented. The LVQ training algorithms are capable of producing highly discriminative reference vectors for classifying static patterns, i.e., vectors with a fixed dimension. The HMM formulation has also been successfully applied to the recognition of dynamic speech patterns that are of variable duration. It is shown that by combining both LVQ´s discriminative power and the HMM´s capability of modeling temporal variations of speech in a hybrid algorithm, the performance of the original HMM-based speech recognizer is significantly improved. For a highly confusable vocabulary consisting of the nine American English E-set letters used in a multispeaker, isolated-word test mode, the average word accuracy of the baseline HMM recognizer is 67%. When LVQ is incorporated in the hybrid system, the word accuracy increases to 83%

Keywords :

hidden Markov models; speech recognition; vector quantisation; American English E-set letters; HMM segmentation; LVQ; average word accuracy; discriminative reference vectors; dynamic speech patterns; hidden Markov models; highly confusable vocabulary; hybrid algorithm; isolated-word test mode; learning vector quantization; speech recognition; static patterns classification; temporal variations; training algorithms; variable duration patterns; Algorithm design and analysis; Artificial neural networks; Helium; Hidden Markov models; Neural networks; Pattern recognition; Speech recognition; Testing; Vector quantization; Vocabulary;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.242488

Filename :

242488

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=968164