• DocumentCode
    3489854
  • Title

    Improving Word Alignment for Statistical Machine Translation Based on Constraints

  • Author

    Le Quang-Hung ; Le Anh-Cuong

  • Author_Institution
    Fac. of Inf. Technol., Quynhon Univ., Quy Nhon, Vietnam
  • fYear
    2012
  • fDate
    13-15 Nov. 2012
  • Firstpage
    113
  • Lastpage
    116
  • Abstract
    Word alignment is an important and fundamental task for building a statistical machine translation (SMT) system. However, obtaining word-level alignments in parallel corpora with high accuracy is still a challenge. In this paper, we propose a new method, which is based on constraint approach, to improve the quality of word alignment. Our experiments show that using constraints for the parameter estimation of the IBM models reduces the alignment error rate down to 7.26% and increases the BLEU score to 5%, in the case of translation from English to Vietnamese.
  • Keywords
    language translation; natural language processing; statistical analysis; BLEU score; English; IBM models; SMT system; Vietnamese; alignment error rate reduction; parallel corpora; parameter estimation; statistical machine translation system; word alignment improvement; word-level alignments; Computational linguistics; Computational modeling; Dictionaries; Error analysis; Parameter estimation; Training; Training data; statistical machine translation; word alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2012 International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4673-6113-2
  • Electronic_ISBN
    978-0-7695-4886-9
  • Type

    conf

  • DOI
    10.1109/IALP.2012.45
  • Filename
    6473709