Title :
Using Translation Paraphrases from Trilingual Corpora to Improve Phrase-Based Statistical Machine Translation: A Preliminary Report
Author :
Herrera, Francisco Guzmán ; Luna, Leonardo Garrido
Author_Institution :
Centro de Sist. Inteligentes, Inst. Tecnol. de Monterrey, Monterrey
Abstract :
Statistical methods have proven to be very effective when addressing linguistic problems, specially when dealing with machine translation. Nevertheless, statistical machine translation effectiveness is limited to situations where large amounts of training data are available. Therefore, the broader the coverage of a SMT system is, the better the chances to get a reasonable output are. In this paper we propose a method to improve quality of translations of a phrase-based machine translation system by extending phrase-tables with the use of translation paraphrases learned from a third language. Our experiments were done translating from Spanish to English pivoting through French.
Keywords :
computational linguistics; language translation; linguistics; statistical analysis; Spanish to English translation; phrase-based statistical machine translation; translation paraphrases; trilingual corpora; Artificial intelligence; Decoding; Natural languages; Statistical analysis; Surface-mount technology; Training data; Paraphrases; Statistical Machine Translation; interlingua;
Conference_Titel :
Artificial Intelligence - Special Session, 2007. MICAI 2007. Sixth Mexican International Conference on
Conference_Location :
Aguascallentes
Print_ISBN :
978-0-7695-3124-3
DOI :
10.1109/MICAI.2007.34