Title :
Inversion transduction grammar coverage of arabic-english word alignment for tree-structured statistical machine translation
Author :
Dekai Wu ; Carpuat, M. ; Yihai Shen
Author_Institution :
Dept. of Comput. Sci. & Eng., HKUST, Hong Kong
Abstract :
We present the first known direct measurement of word alignment coverage on an Arabic-English parallel corpus using inversion transduction grammar constraints. While direct measurements have been reported for several European and Asian languages, to date no results have been available for Arabic or any Semitic language despite much recent activity on Arabic- English spoken language and text translation. Many recent syntax based statistical MT models operate within the domain of ITG expressiveness, often for efficiency reasons, so it has become important to determine the extent to which the ITG constraint assumption holds. Our results on Arabic provide further evidence that ITG expressiveness appears largely sufficient for core MT models.
Keywords :
grammars; language translation; natural language processing; text analysis; Arabic-English parallel corpus; Arabic-English word alignment; Semitic language; inversion transduction grammar coverage; text translation; transduction grammar constraints; tree-structured statistical machine translation; Computer science; Context modeling; Decoding; Error analysis; Formal languages; Hidden Markov models; Humans; Marine technology; Natural languages; Oral communication;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326798