• DocumentCode
    3316939
  • Title

    Language model adaptation based on the classification of a trigram´s language style feature

  • Author

    Liang, Qi ; Zheng, Thomas Fang ; Xu, Mingxing ; Wu, Wenhu

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    91
  • Lastpage
    96
  • Abstract
    In this paper, an adaptation method of the language style of a language model is proposed based on the differences between spoken and written language. Several interpolation methods based on trigram counts are used for adaptation. An interpolation method considering Katz smoothing computes weights according to the confidence score of a trigram. An adaptation method based on the classification of a trigram´s style feature computes weights dynamically according to the trigram´s language style tendency, and several weight generation functions are proposed. Experiments for spoken language on the Chinese corpora show that these methods, especially the method considering both a trigram´s confidence and style tendency, can achieve a reduction in the Chinese character error rate for pinyin-to-character conversion.
  • Keywords
    computational linguistics; interpolation; natural languages; Chinese character error rate; Chinese corpora; interpolation methods; language model adaptation; pinyin-to-character conversion; spoken language; trigram language style feature classification; Adaptation model; Computer science; Concrete; Intelligent systems; Interpolation; Laboratories; Maximum likelihood estimation; Natural languages; Smoothing methods; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598713
  • Filename
    1598713