Title :
A tree-based statistical language model for natural language speech recognition
Author :
Bahl, Lalit R. ; Brown, Peter F. ; De Souza, Peter V. ; Mercer, Robert L.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fDate :
7/1/1989 12:00:00 AM
Abstract :
The problem of predicting the next word a speaker will say, given the words already spoken; is discussed. Specifically, the problem is to estimate the probability that a given word will be the next word uttered. Algorithms are presented for automatically constructing a binary decision tree designed to estimate these probabilities. At each node of the tree there is a yes/no question relating to the words already spoken, and at each leaf there is a probability distribution over the allowable vocabulary. Ideally, these nodal questions can take the form of arbitrarily complex Boolean expressions, but computationally cheaper alternatives are also discussed. Some results obtained on a 5000-word vocabulary with a tree designed to predict the next word spoken from the preceding 20 words are included. The tree is compared to an equivalent trigram model and shown to be superior
Keywords :
decision theory; speech recognition; Boolean expressions; binary decision tree; natural language speech recognition; probability distribution; tree-based statistical language model; trigram model; Algorithm design and analysis; Automatic speech recognition; Decision trees; Frequency estimation; Loudspeakers; Maximum likelihood estimation; Natural languages; Probability distribution; Speech recognition; Vocabulary;
Journal_Title :
Acoustics, Speech and Signal Processing, IEEE Transactions on