DocumentCode
1858887
Title
Reordering experiments for n-gram-based SMT
Author
Crego, J.M. ; Marino, J.B.
Author_Institution
TALP Res. Center, Univ. Politec. de Catalunya, Barcelona
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
242
Lastpage
245
Abstract
This paper addresses the problem of reordering in statistical machine translation (SMT). We describe an elegant and efficient approach to couple reordering (word order monotonization) and decoding, which does not need for any additional model. We use linguistically motivated reordering rules to extend a monotonic search graph (with reordering hypotheses). The extended graph is traversed in decoding when a fully- informed decision can be taken (no preprocessing decision about reordering is taken). We also show how the N-gram translation model can be successfully used as reordering model when estimated with reordered source words (to harmonize the source and target word order). Experiments are reported on the Euparl task (Spanish- to-English and English-to-Spanish). Results are presented regarding translation accuracy and computational efficiency, showing significant improvements in translation quality for both translation directions at a very low computational cost.
Keywords
computational linguistics; decoding; graph theory; language translation; word processing; decoding; monotonic search graph; n-gram-based SMT; reordering rules; statistical machine translation; translation quality; word order monotonization; Computational efficiency; Decoding; Entropy; Equations; Frequency; Surface-mount technology; System testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location
Palm Beach
Print_ISBN
1-4244-0872-5
Type
conf
DOI
10.1109/SLT.2006.326800
Filename
4123407
Link To Document