DocumentCode :
1512497
Title :
Fast likelihood computation techniques in nearest-neighbor based search for continuous speech recognition
Author :
Pellom, Bryan L. ; Sarikaya, Ruhi ; Hansen, John H L
Author_Institution :
Center for Speech & Language Res., Colorado Univ., Boulder, CO, USA
Volume :
8
Issue :
8
fYear :
2001
Firstpage :
221
Lastpage :
224
Abstract :
This paper describes two effective algorithms that reduce the computational complexity of state likelihood computation in mixture-based Gaussian speech recognition systems. We consider a baseline recognition system that uses nearest-neighbor search and partial distance elimination (PDE) to compute state likelihoods. The first algorithm exploits the high dependence exhibited among subsequent feature vectors to predict the best scoring mixture for each state. The method, termed best mixture prediction (BMP), leads to further speed improvement in the PDE technique. The second technique, termed feature component reordering (FCR), takes advantage of the variable contribution levels made to the final distortion score for each dimension of the feature and mean space vectors. The combination of two techniques with PDE reduces the computational time for likelihood computation by 29.8% over baseline likelihood computation. The algorithms are shown to yield the same accuracy level without further memory requirements for the November 1992 ARPA Wall Street Journal (WSJ) task.
Keywords :
computational complexity; maximum likelihood estimation; search problems; speech recognition; BMP; FCR; PDE; best mixture prediction; computational complexity; computational time; continuous speech recognition; fast likelihood computation techniques; feature component reordering; feature vectors; likelihood computation; mixture-based Gaussian speech recognition; nearest-neighbor based search; partial distance elimination; state likelihood computation; Binary search trees; Computational complexity; Covariance matrix; Distortion measurement; Hidden Markov models; Linear discriminant analysis; Natural languages; Nearest neighbor searches; Speech recognition; Vectors;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/97.935736
Filename :
935736
Link To Document :
بازگشت