A fast approximate acoustic match for large vocabulary speech recognition

Author

Bahl, Lalit R. ; De Gennaro, Steven V. ; Gopalakrishnan, P.S. ; Mercer, Robert L.

Author_Institution

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

Volume

1

Issue

1

fYear

1993

fDate

1/1/1993 12:00:00 AM

Firstpage

59

Lastpage

67

Abstract

In a large vocabulary speech recognition system using hidden Markov models, calculating the likelihood of an acoustic signal segment for all the words in the vocabulary involves a large amount of computation. In order to run in real time on a modest amount of hardware, it is important that these detailed acoustic likelihood computations be performed only on words which have a reasonable probability of being the word that was spoken. The authors describe a scheme for rapidly obtaining an approximate acoustic match for all the words in the vocabulary in such a way as to ensure that the correct word is, with high probability, one of a small number of words examined in detail. Using fast search methods, they obtain a matching algorithm that is about a hundred times faster than doing a detailed acoustic likelihood computation on all the words in the IBM Office Correspondence isolated word dictation task, which has a vocabulary of 20000 words. Experimental results showing the effectiveness of such a fast match for a number of talkers are given

Keywords

acoustic signal processing; dictation; hidden Markov models; search problems; speech recognition; IBM Office Correspondence; acoustic likelihood computations; acoustic signal segment; approximate acoustic match; fast search methods; hidden Markov models; isolated word dictation task; large vocabulary speech recognition; matching algorithm; Acoustics; Arithmetic; Hardware; Hidden Markov models; Iterative decoding; Out of order; Search methods; Speech processing; Speech recognition; Vocabulary;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/89.221368

Filename

221368