DocumentCode
1858859
Title
Syntactic phrase-based statistical machine translation
Author
Hassan, H. ; Hearne, M. ; Way, A. ; Simaan, K.
Author_Institution
Sch. of Comput., Dublin City Univ., Dublin
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
238
Lastpage
241
Abstract
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT today. However, unlike systems in other paradigms, it has proven difficult to date to incorporate syntactic knowledge in order to improve translation quality. This paper improves on recent research which uses ´syntactified´ target language phrases, by incorporating supertags as constraints to better resolve parse tree fragments. In addition, we do not impose any sentence-length limit, and using a log-linear decoder, we outperform a state-of-the-art PBSMT system by over 1.3 BLEU points (or 3.51% relative) on the NIST 2003 Arabic-English test corpus.
Keywords
decoding; grammars; language translation; log-linear decoder; parse tree fragments; syntactic knowledge; syntactic phrase-based statistical machine translation; translation quality; Data mining; Decoding; Hidden Markov models; NIST; Robustness; Statistics; Surface-mount technology; System testing; TV; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location
Palm Beach
Print_ISBN
1-4244-0872-5
Type
conf
DOI
10.1109/SLT.2006.326799
Filename
4123406
Link To Document