• DocumentCode
    1858837
  • Title

    Inversion transduction grammar coverage of arabic-english word alignment for tree-structured statistical machine translation

  • Author

    Dekai Wu ; Carpuat, M. ; Yihai Shen

  • Author_Institution
    Dept. of Comput. Sci. & Eng., HKUST, Hong Kong
  • fYear
    2006
  • fDate
    10-13 Dec. 2006
  • Firstpage
    234
  • Lastpage
    237
  • Abstract
    We present the first known direct measurement of word alignment coverage on an Arabic-English parallel corpus using inversion transduction grammar constraints. While direct measurements have been reported for several European and Asian languages, to date no results have been available for Arabic or any Semitic language despite much recent activity on Arabic- English spoken language and text translation. Many recent syntax based statistical MT models operate within the domain of ITG expressiveness, often for efficiency reasons, so it has become important to determine the extent to which the ITG constraint assumption holds. Our results on Arabic provide further evidence that ITG expressiveness appears largely sufficient for core MT models.
  • Keywords
    grammars; language translation; natural language processing; text analysis; Arabic-English parallel corpus; Arabic-English word alignment; Semitic language; inversion transduction grammar coverage; text translation; transduction grammar constraints; tree-structured statistical machine translation; Computer science; Context modeling; Decoding; Error analysis; Formal languages; Hidden Markov models; Humans; Marine technology; Natural languages; Oral communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2006. IEEE
  • Conference_Location
    Palm Beach
  • Print_ISBN
    1-4244-0872-5
  • Type

    conf

  • DOI
    10.1109/SLT.2006.326798
  • Filename
    4123405