• DocumentCode
    2379115
  • Title

    Reordering pPhrase-based machine translation over chunks

  • Author

    Van Nguyen, V. ; Thai Phuong Nguyen ; Shimazu, Akira ; Minh Le Nguyen

  • Author_Institution
    Japan Adv. Inst. of Sci. & Technol., Ishikawa
  • fYear
    2008
  • fDate
    13-17 July 2008
  • Firstpage
    114
  • Lastpage
    119
  • Abstract
    The paper presents a new method for reordering in phrase based statistical machine translation (PBMT). Our method is based on previous chunk-level reordering methods for PBMT. First, we parse the source language sentence to a chunk tree, according to the method developed by [16]. Second, we apply a series of transformation rules which are learnt automatically from the parallel corpus to the chunk tree over chunk level. Finally, we integrate a global reordering model directly in a decoder as a graph of phrases, and solve the overlapping phrase and chunk problem. The experimental results with English-Vietnamese pairs show that our method outperforms the baseline PBMT in both accuracy and speed.
  • Keywords
    language translation; statistical analysis; trees (mathematics); English-Vietnamese pairs; chunk tree; chunk-level reordering methods; global reordering model; phrase-based machine translation reordering; source language sentence; statistical machine translation; Buildings; Data preprocessing; Decoding; Entropy; Morphology; Robustness; Surface-mount technology; Tree data structures; Tree graphs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research, Innovation and Vision for the Future, 2008. RIVF 2008. IEEE International Conference on
  • Conference_Location
    Ho Chi Minh City
  • Print_ISBN
    978-1-4244-2379-8
  • Electronic_ISBN
    978-1-4244-2380-4
  • Type

    conf

  • DOI
    10.1109/RIVF.2008.4586342
  • Filename
    4586342