Title :
Syntactic phrase-based statistical machine translation
Author :
Hassan, H. ; Hearne, M. ; Way, A. ; Simaan, K.
Author_Institution :
Sch. of Comput., Dublin City Univ., Dublin
Abstract :
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT today. However, unlike systems in other paradigms, it has proven difficult to date to incorporate syntactic knowledge in order to improve translation quality. This paper improves on recent research which uses ´syntactified´ target language phrases, by incorporating supertags as constraints to better resolve parse tree fragments. In addition, we do not impose any sentence-length limit, and using a log-linear decoder, we outperform a state-of-the-art PBSMT system by over 1.3 BLEU points (or 3.51% relative) on the NIST 2003 Arabic-English test corpus.
Keywords :
decoding; grammars; language translation; log-linear decoder; parse tree fragments; syntactic knowledge; syntactic phrase-based statistical machine translation; translation quality; Data mining; Decoding; Hidden Markov models; NIST; Robustness; Statistics; Surface-mount technology; System testing; TV; Vocabulary;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326799