DocumentCode
3746210
Title
Template-based model for Mongolian - Chinese machine translation
Author
Wu Jing;Hongxu Hou;Feilong Bao;Yupeng Jiang
Author_Institution
College of Computer Science, Inner Mongolia University, Hohhot, China
fYear
2015
Firstpage
352
Lastpage
357
Abstract
Mongolian-Chinese statistical machine translation (SMT) system has its limitation because of the significant syntax differences, complex Mongolian morphology and scarce resource of parallel corpus. Template-based machine translation (TBMT) can produce high accuracy and good syntax structure translations with a relative small corpus. Therefore, SMT systems can combine with TBMT system to get better translations. We built a Mongolian-Chinese TBMT system including a template extraction model and a template translation model. We proposed a novel method of aligning and abstracting static words from bilingual parallel examples to extracts templates automatically. We also proposed a method to filter out low quality TBMT translations to enhance the combined systems. Moreover, we applied lemmatization and latinization to address the problem data sparsity and fuzzy match. Experimentally, the translation of TBMT outperformed the baselines of phrase-based SMT system and hierarchical phrase-based SMT system. The combined system of the TBMT and the SMT systems also performed better than the baselines. Besides, the coverage can satisfy the combined systems.
Keywords
"Libraries","Dictionaries","Syntactics","Matched filters","Training","Feature extraction","Indexes"
Publisher
ieee
Conference_Titel
Technologies and Applications of Artificial Intelligence (TAAI), 2015 Conference on
Electronic_ISBN
2376-6824
Type
conf
DOI
10.1109/TAAI.2015.7407090
Filename
7407090
Link To Document