DocumentCode
2281943
Title
Employing topic modeling for statistical machine translation
Author
Zhengxian, Gong ; Guodong, Zhou
Author_Institution
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Volume
4
fYear
2011
fDate
10-12 June 2011
Firstpage
24
Lastpage
28
Abstract
The mixture modeling approaches have dominated the research of domain adaptation in Statistical Machine Translation (SMT). Such approaches construct a general model and several sub-models in advance and focus on the way of determining the relative importance of all the models. In this paper, we propose a simple yet effective approach for better domain adaptation in phrase-based SMT via topic modeling. Different from existing approaches, our topic modeling approach employs one additional feature function to capture the topic inherent in the source phrase and help the decoder dynamically choose related target phrases according to the specific topic of the source phrase. Evaluation on a conversation corpus shows very encouraging results.
Keywords
language translation; statistical analysis; conversation corpus; mixture modeling approaches; phrase-based SMT; source phrase; statistical machine translation; topic modeling approach; Adaptation models; Analytical models; Business; Computational modeling; Decoding; Hidden Markov models; Training; domain adaptation; statistical macine translation; topic modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-8727-1
Type
conf
DOI
10.1109/CSAE.2011.5952796
Filename
5952796
Link To Document