DocumentCode :
1606175
Title :
Chinese-uyghur statistical machine translation: The initial explorations
Author :
Dong, Xinghua ; Xue, Huajian ; Ma, Bo ; Wang, Lei
Author_Institution :
Xingjiang Tech. Inst. of Phys. & Chem., Chinese Acad. of China, Urumchi, China
fYear :
2010
Firstpage :
320
Lastpage :
324
Abstract :
In this paper, we present results of initial explorations to a phrase-based statistical machine translation system for a new language pair, namely Chinese-Uyghur. They are very different from each other, the characters of the former almost are hieroglyphics, morpheme processing don´t work at all, but the latter is an agglutinative language with very productive inflectional and derivational word-formation processes. To make them more similar, we reorder Chinese sentence structures from SVO to SOV and split Uyghur words into morphemes. The experiments show reordering Chinese sentence structure and properly splitting granularity for Uyghur can effectively improve the performances of translation system.
Keywords :
language translation; natural language processing; statistical analysis; Chinese sentence structure reordering; Chinese-Uyghur statistical machine translation; SOV; SVO; agglutinative language; derivational word-formation process; hieroglyphics; morpheme processing; phrase-based statistical machine translation system; Computational linguistics; Computational modeling; Conferences; Decoding; Government; Morphology; Training; Uyghur; morphemes; phrase-based; splitting granularity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7821-7
Type :
conf
DOI :
10.1109/IUCS.2010.5666183
Filename :
5666183
Link To Document :
بازگشت