• DocumentCode
    3746210
  • Title

    Template-based model for Mongolian - Chinese machine translation

  • Author

    Wu Jing;Hongxu Hou;Feilong Bao;Yupeng Jiang

  • Author_Institution
    College of Computer Science, Inner Mongolia University, Hohhot, China
  • fYear
    2015
  • Firstpage
    352
  • Lastpage
    357
  • Abstract
    Mongolian-Chinese statistical machine translation (SMT) system has its limitation because of the significant syntax differences, complex Mongolian morphology and scarce resource of parallel corpus. Template-based machine translation (TBMT) can produce high accuracy and good syntax structure translations with a relative small corpus. Therefore, SMT systems can combine with TBMT system to get better translations. We built a Mongolian-Chinese TBMT system including a template extraction model and a template translation model. We proposed a novel method of aligning and abstracting static words from bilingual parallel examples to extracts templates automatically. We also proposed a method to filter out low quality TBMT translations to enhance the combined systems. Moreover, we applied lemmatization and latinization to address the problem data sparsity and fuzzy match. Experimentally, the translation of TBMT outperformed the baselines of phrase-based SMT system and hierarchical phrase-based SMT system. The combined system of the TBMT and the SMT systems also performed better than the baselines. Besides, the coverage can satisfy the combined systems.
  • Keywords
    "Libraries","Dictionaries","Syntactics","Matched filters","Training","Feature extraction","Indexes"
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2015 Conference on
  • Electronic_ISBN
    2376-6824
  • Type

    conf

  • DOI
    10.1109/TAAI.2015.7407090
  • Filename
    7407090