Title :
Minimizing search errors due to delayed bigrams in real-time speech recognition systems
Author :
Woszczyna, Monika ; Finke, M.
Author_Institution :
Interactive Syst. Lab., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
When building applications from large vocabulary speech recognition systems, a certain amount of search errors due to pruning often has to be accepted in order to obtain the required speed. We tackle the problems resulting from aggressive pruning strategies as typically applied in large vocabulary systems to achieve close to real-time performance. We consider a typical scenario of a two pass Viterbi search with the first pass being organized as a phoneme (allophone) tree. For such a tree organized lexicon, there are two possibilities to use a bigram language model: either by building tree copies or by using so-called delayed bigrams. Since copying trees turns out to be too expensive for real time applications we basically refer to delayed bigrams, discuss their drastic influence on the word accuracy and show how to alleviate the disastrous effect of delayed bigrams under aggressive pruning
Keywords :
delays; grammars; natural languages; real-time systems; speech recognition; tree searching; aggressive pruning; allophone tree; bigram language model; delayed bigrams; large vocabulary speech recognition systems; phoneme tree; real-time performance; real-time speech recognition systems; search errors minimisation; tree organized lexicon; two pass Viterbi search; word accuracy; Delay effects; Interactive systems; Laboratories; Law; Legal factors; Natural languages; Real time systems; Speech recognition; Viterbi algorithm; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.540309