Title :
Non-deterministic stochastic language models for speech recognition
Author :
Riccardi, G. ; Bocchieri, E. ; Pieraccini, R.
Author_Institution :
Dept. of Speech Res., AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
Traditional stochastic language models for speech recognition (i.e. n-grams) are deterministic, in the sense that there is one and only one derivation for each given sentence. Moreover a fixed temporal window is always assumed in the estimation of the traditional stochastic language models. This paper shows how non-determinism is introduced to effectively approximate a back-off n-gram language model through a finite state network formalism. It also shows that a new flexible and powerful network formalization can be obtained by releasing the assumption of a fixed history size. As a result, a class of automata for language modeling (variable n-gram stochastic automata) is obtained, for which we propose some methods for the estimation of the transition probabilities. VNSAs have been used in a spontaneous speech recognizer for the ATIS task. The accuracy on a standard test set is presented
Keywords :
estimation theory; finite automata; grammars; natural languages; probability; speech processing; speech recognition; stochastic automata; ATIS task; back-off n-gram language model; finite state network formalism; fixed temporal window; language modeling automata; nondeterministic stochastic language models; recognition accuracy; speech recognition; spontaneous speech recognizer; standard test set; stochastic language models estimation; transition probabilities; variable n-gram stochastic automata; Automata; Automatic speech recognition; Maximum likelihood decoding; Maximum likelihood estimation; Natural languages; Speech recognition; Stochastic processes; Testing; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479408