• DocumentCode
    1858887
  • Title

    Reordering experiments for n-gram-based SMT

  • Author

    Crego, J.M. ; Marino, J.B.

  • Author_Institution
    TALP Res. Center, Univ. Politec. de Catalunya, Barcelona
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    242
  • Lastpage
    245
  • Abstract
    This paper addresses the problem of reordering in statistical machine translation (SMT). We describe an elegant and efficient approach to couple reordering (word order monotonization) and decoding, which does not need for any additional model. We use linguistically motivated reordering rules to extend a monotonic search graph (with reordering hypotheses). The extended graph is traversed in decoding when a fully- informed decision can be taken (no preprocessing decision about reordering is taken). We also show how the N-gram translation model can be successfully used as reordering model when estimated with reordered source words (to harmonize the source and target word order). Experiments are reported on the Euparl task (Spanish- to-English and English-to-Spanish). Results are presented regarding translation accuracy and computational efficiency, showing significant improvements in translation quality for both translation directions at a very low computational cost.
  • Keywords
    computational linguistics; decoding; graph theory; language translation; word processing; decoding; monotonic search graph; n-gram-based SMT; reordering rules; statistical machine translation; translation quality; word order monotonization; Computational efficiency; Decoding; Entropy; Equations; Frequency; Surface-mount technology; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326800
  • Filename
    4123407