DocumentCode
3316939
Title
Language model adaptation based on the classification of a trigram´s language style feature
Author
Liang, Qi ; Zheng, Thomas Fang ; Xu, Mingxing ; Wu, Wenhu
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear
2005
fDate
30 Oct.-1 Nov. 2005
Firstpage
91
Lastpage
96
Abstract
In this paper, an adaptation method of the language style of a language model is proposed based on the differences between spoken and written language. Several interpolation methods based on trigram counts are used for adaptation. An interpolation method considering Katz smoothing computes weights according to the confidence score of a trigram. An adaptation method based on the classification of a trigram´s style feature computes weights dynamically according to the trigram´s language style tendency, and several weight generation functions are proposed. Experiments for spoken language on the Chinese corpora show that these methods, especially the method considering both a trigram´s confidence and style tendency, can achieve a reduction in the Chinese character error rate for pinyin-to-character conversion.
Keywords
computational linguistics; interpolation; natural languages; Chinese character error rate; Chinese corpora; interpolation methods; language model adaptation; pinyin-to-character conversion; spoken language; trigram language style feature classification; Adaptation model; Computer science; Concrete; Intelligent systems; Interpolation; Laboratories; Maximum likelihood estimation; Natural languages; Smoothing methods; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN
0-7803-9361-9
Type
conf
DOI
10.1109/NLPKE.2005.1598713
Filename
1598713
Link To Document