• DocumentCode
    1401393
  • Title

    Two decades of statistical language modeling: where do we go from here?

  • Author

    Rosenfeld, Ronald

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • Volume
    88
  • Issue
    8
  • fYear
    2000
  • Firstpage
    1270
  • Lastpage
    1278
  • Abstract
    Statistical language models estimate the distribution of various natural language phenomena for the purpose of speech recognition and other language technologies. Since the first significant model was proposed in 1980, many attempts have been made to improve the state of the art. We review them, point to a few promising directions, and argue for a Bayesian approach to integration of linguistic theories with data.
  • Keywords
    Bayes methods; computational linguistics; natural languages; probability; reviews; Bayesian approach; linguistic theory; natural language processing; natural language technologies; probability; speech recognition; state of the art; statistical language modeling; Associate members; Bayesian methods; Information retrieval; Natural languages; Optical character recognition software; Paper technology; Probability distribution; Routing; Speech recognition; Training data;
  • fLanguage
    English
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/5.880083
  • Filename
    880083