Title :
A cache-based natural language model for speech recognition
Author :
Kuhn, Roland ; de Mori, Renato
Author_Institution :
Sch. of Comput. Sci., McGill Univ., Montreal, Que., Canada
fDate :
6/1/1990 12:00:00 AM
Abstract :
Speech-recognition systems must often decide between competing ways of breaking up the acoustic input into strings of words. Since the possible strings may be acoustically similar, a language model is required; given a word string, the model returns its linguistic probability. Several Markov language models are discussed. A novel kind of language model which reflects short-term patterns of word use by means of a cache component (analogous to cache memory in hardware terminology) is presented. The model also contains a 3g-gram component of the traditional type. The combined model and a pure 3g-gram model were tested on samples drawn from the Lancaster-Oslo/Bergen (LOB) corpus of English text. The relative performance of the two models is examined, and suggestions for the future improvements are made
Keywords :
Markov processes; natural languages; probability; speech recognition; English text; Lancaster-Oslo/Bergen; Markov language models; cache-based natural language model; linguistic probability; speech recognition; word string; Automatic speech recognition; Cache memory; Frequency estimation; Hardware; Magnetooptic recording; Natural languages; Probability; Speech recognition; Terminology; Testing; Vocabulary;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on