• DocumentCode
    1858859
  • Title

    Syntactic phrase-based statistical machine translation

  • Author

    Hassan, H. ; Hearne, M. ; Way, A. ; Simaan, K.

  • Author_Institution
    Sch. of Comput., Dublin City Univ., Dublin
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    238
  • Lastpage
    241
  • Abstract
    Phrase-based statistical machine translation (PBSMT) systems represent the dominant approach in MT today. However, unlike systems in other paradigms, it has proven difficult to date to incorporate syntactic knowledge in order to improve translation quality. This paper improves on recent research which uses ´syntactified´ target language phrases, by incorporating supertags as constraints to better resolve parse tree fragments. In addition, we do not impose any sentence-length limit, and using a log-linear decoder, we outperform a state-of-the-art PBSMT system by over 1.3 BLEU points (or 3.51% relative) on the NIST 2003 Arabic-English test corpus.
  • Keywords
    decoding; grammars; language translation; log-linear decoder; parse tree fragments; syntactic knowledge; syntactic phrase-based statistical machine translation; translation quality; Data mining; Decoding; Hidden Markov models; NIST; Robustness; Statistics; Surface-mount technology; System testing; TV; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326799
  • Filename
    4123406