مرکز منطقه ای اطلاع رساني علوم و فناوري - State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs

DocumentCode :

1476455

Title :

State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs

Author :

Gales, Mark J F ; Knill, Katherine M. ; Young, Stephen J.

Volume :

Issue :

fYear :

1999

fDate :

3/1/1999 12:00:00 AM

Firstpage :

152

Lastpage :

161

Abstract :

This paper investigates the use of Gaussian selection (GS) to increase the speed of a large vocabulary speech recognition system. Typically, 30-70% of the computational time of a continuous density hidden Markov model-based (HMM-based) speech recognizer is spent calculating probabilities. The aim of CS is to reduce this load by selecting the subset of Gaussian component likelihoods that should be computed given a particular input vector. This paper examines new techniques for obtaining “good” Gaussian subsets or “shortlists.” All the new schemes make use of state information, specifically, to which state each of the Gaussian components belongs. In this way, a maximum number of Gaussian components per state may be specified, hence reducing the size of the shortlist. The first technique introduced is a simple extension of the standard GS method, which uses this state information. Then, more complex schemes based on maximizing the likelihood of the training data are proposed. These new approaches are compared with the standard GS scheme on a large vocabulary speech recognition task. On this task, the use of state information reduced the percentage of Gaussians computed to 10-15%, compared with 20-30% for the standard GS scheme, with little degradation in performance

Keywords :

Gaussian processes; hidden Markov models; speech coding; speech recognition; vector quantisation; Gaussian component likelihoods; Gaussian components; HMM; VQ; codewords; computational time; continuous density hidden Markov model; input vector; large vocabulary continuous speech recognition; probabilities; shortlists; state information; state-based Gaussian selection; training data; Decoding; Degradation; Hidden Markov models; Linear discriminant analysis; Probability; Real time systems; Speech recognition; Training data; Vectors; Vocabulary;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.748120

Filename :

748120

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1476455