DocumentCode
1401393
Title
Two decades of statistical language modeling: where do we go from here?
Author
Rosenfeld, Ronald
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
88
Issue
8
fYear
2000
Firstpage
1270
Lastpage
1278
Abstract
Statistical language models estimate the distribution of various natural language phenomena for the purpose of speech recognition and other language technologies. Since the first significant model was proposed in 1980, many attempts have been made to improve the state of the art. We review them, point to a few promising directions, and argue for a Bayesian approach to integration of linguistic theories with data.
Keywords
Bayes methods; computational linguistics; natural languages; probability; reviews; Bayesian approach; linguistic theory; natural language processing; natural language technologies; probability; speech recognition; state of the art; statistical language modeling; Associate members; Bayesian methods; Information retrieval; Natural languages; Optical character recognition software; Paper technology; Probability distribution; Routing; Speech recognition; Training data;
fLanguage
English
Journal_Title
Proceedings of the IEEE
Publisher
ieee
ISSN
0018-9219
Type
jour
DOI
10.1109/5.880083
Filename
880083
Link To Document