Title :
Template-based model for Mongolian - Chinese machine translation
Author :
Wu Jing;Hongxu Hou;Feilong Bao;Yupeng Jiang
Author_Institution :
College of Computer Science, Inner Mongolia University, Hohhot, China
Abstract :
Mongolian-Chinese statistical machine translation (SMT) system has its limitation because of the significant syntax differences, complex Mongolian morphology and scarce resource of parallel corpus. Template-based machine translation (TBMT) can produce high accuracy and good syntax structure translations with a relative small corpus. Therefore, SMT systems can combine with TBMT system to get better translations. We built a Mongolian-Chinese TBMT system including a template extraction model and a template translation model. We proposed a novel method of aligning and abstracting static words from bilingual parallel examples to extracts templates automatically. We also proposed a method to filter out low quality TBMT translations to enhance the combined systems. Moreover, we applied lemmatization and latinization to address the problem data sparsity and fuzzy match. Experimentally, the translation of TBMT outperformed the baselines of phrase-based SMT system and hierarchical phrase-based SMT system. The combined system of the TBMT and the SMT systems also performed better than the baselines. Besides, the coverage can satisfy the combined systems.
Keywords :
"Libraries","Dictionaries","Syntactics","Matched filters","Training","Feature extraction","Indexes"
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2015 Conference on
Electronic_ISBN :
2376-6824
DOI :
10.1109/TAAI.2015.7407090