DocumentCode :
1858859
Title :
Syntactic phrase-based statistical machine translation
Author :
Hassan, H. ; Hearne, M. ; Way, A. ; Simaan, K.
Author_Institution :
Sch. of Comput., Dublin City Univ., Dublin
fYear :
2006
fDate :
10-13 Dec. 2006
Firstpage :
238
Lastpage :
241
Abstract :
Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT today. However, unlike systems in other paradigms, it has proven difficult to date to incorporate syntactic knowledge in order to improve translation quality. This paper improves on recent research which uses ´syntactified´ target language phrases, by incorporating supertags as constraints to better resolve parse tree fragments. In addition, we do not impose any sentence-length limit, and using a log-linear decoder, we outperform a state-of-the-art PBSMT system by over 1.3 BLEU points (or 3.51% relative) on the NIST 2003 Arabic-English test corpus.
Keywords :
decoding; grammars; language translation; log-linear decoder; parse tree fragments; syntactic knowledge; syntactic phrase-based statistical machine translation; translation quality; Data mining; Decoding; Hidden Markov models; NIST; Robustness; Statistics; Surface-mount technology; System testing; TV; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
Type :
conf
DOI :
10.1109/SLT.2006.326799
Filename :
4123406
Link To Document :
بازگشت