DocumentCode :
3316939
Title :
Language model adaptation based on the classification of a trigram´s language style feature
Author :
Liang, Qi ; Zheng, Thomas Fang ; Xu, Mingxing ; Wu, Wenhu
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2005
fDate :
30 Oct.-1 Nov. 2005
Firstpage :
91
Lastpage :
96
Abstract :
In this paper, an adaptation method of the language style of a language model is proposed based on the differences between spoken and written language. Several interpolation methods based on trigram counts are used for adaptation. An interpolation method considering Katz smoothing computes weights according to the confidence score of a trigram. An adaptation method based on the classification of a trigram´s style feature computes weights dynamically according to the trigram´s language style tendency, and several weight generation functions are proposed. Experiments for spoken language on the Chinese corpora show that these methods, especially the method considering both a trigram´s confidence and style tendency, can achieve a reduction in the Chinese character error rate for pinyin-to-character conversion.
Keywords :
computational linguistics; interpolation; natural languages; Chinese character error rate; Chinese corpora; interpolation methods; language model adaptation; pinyin-to-character conversion; spoken language; trigram language style feature classification; Adaptation model; Computer science; Concrete; Intelligent systems; Interpolation; Laboratories; Maximum likelihood estimation; Natural languages; Smoothing methods; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
Type :
conf
DOI :
10.1109/NLPKE.2005.1598713
Filename :
1598713
Link To Document :
بازگشت