DocumentCode
1606175
Title
Chinese-uyghur statistical machine translation: The initial explorations
Author
Dong, Xinghua ; Xue, Huajian ; Ma, Bo ; Wang, Lei
Author_Institution
Xingjiang Tech. Inst. of Phys. & Chem., Chinese Acad. of China, Urumchi, China
fYear
2010
Firstpage
320
Lastpage
324
Abstract
In this paper, we present results of initial explorations to a phrase-based statistical machine translation system for a new language pair, namely Chinese-Uyghur. They are very different from each other, the characters of the former almost are hieroglyphics, morpheme processing don´t work at all, but the latter is an agglutinative language with very productive inflectional and derivational word-formation processes. To make them more similar, we reorder Chinese sentence structures from SVO to SOV and split Uyghur words into morphemes. The experiments show reordering Chinese sentence structure and properly splitting granularity for Uyghur can effectively improve the performances of translation system.
Keywords
language translation; natural language processing; statistical analysis; Chinese sentence structure reordering; Chinese-Uyghur statistical machine translation; SOV; SVO; agglutinative language; derivational word-formation process; hieroglyphics; morpheme processing; phrase-based statistical machine translation system; Computational linguistics; Computational modeling; Conferences; Decoding; Government; Morphology; Training; Uyghur; morphemes; phrase-based; splitting granularity;
fLanguage
English
Publisher
ieee
Conference_Titel
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location
Beijing
Print_ISBN
978-1-4244-7821-7
Type
conf
DOI
10.1109/IUCS.2010.5666183
Filename
5666183
Link To Document