DocumentCode
2998662
Title
Three probabilistic language models for a large-vocabulary speech recognizer
Author
Dumouchel, P. ; Gupta, V. ; Lennig, M. ; Mermelstein, P.
Author_Institution
INRS-Telecommun., Montreal, Que., Canada
fYear
1988
fDate
11-14 Apr 1988
Firstpage
513
Abstract
Relative performance is compared for three different language models applied to the linguistic decoding part of a 75000-word speech recognizer. These models are the trigram model, the tri-POS model (POS stands for parts of speech), and a smoothed trigram model with tied distributions for words three or more syllables long. The full trigram model gives the best performance but is most expensive in terms of data and storage requirements. The smoothed trigram and tri-POS models yield equivalent performance. For general text entry tasks, use of the tri-POS model is suggested since it is less sensitive to variations in the discourse domains. For applications specific to individual discourse domains, trigram models trained on data obtained from the target domain are recommended
Keywords
natural languages; speech recognition; discourse domains; large-vocabulary speech recognizer; linguistic decoding; parts of speech; probabilistic language models; smoothed trigram model; target domain; text entry; tri-POS model; trigram model; Acoustical engineering; Councils; Databases; Decoding; Frequency; Natural languages; Performance evaluation; Speech recognition; Testing; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location
New York, NY
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.1988.196632
Filename
196632
Link To Document