• DocumentCode
    1606175
  • Title

    Chinese-uyghur statistical machine translation: The initial explorations

  • Author

    Dong, Xinghua ; Xue, Huajian ; Ma, Bo ; Wang, Lei

  • Author_Institution
    Xingjiang Tech. Inst. of Phys. & Chem., Chinese Acad. of China, Urumchi, China
  • fYear
    2010
  • Firstpage
    320
  • Lastpage
    324
  • Abstract
    In this paper, we present results of initial explorations to a phrase-based statistical machine translation system for a new language pair, namely Chinese-Uyghur. They are very different from each other, the characters of the former almost are hieroglyphics, morpheme processing don´t work at all, but the latter is an agglutinative language with very productive inflectional and derivational word-formation processes. To make them more similar, we reorder Chinese sentence structures from SVO to SOV and split Uyghur words into morphemes. The experiments show reordering Chinese sentence structure and properly splitting granularity for Uyghur can effectively improve the performances of translation system.
  • Keywords
    language translation; natural language processing; statistical analysis; Chinese sentence structure reordering; Chinese-Uyghur statistical machine translation; SOV; SVO; agglutinative language; derivational word-formation process; hieroglyphics; morpheme processing; phrase-based statistical machine translation system; Computational linguistics; Computational modeling; Conferences; Decoding; Government; Morphology; Training; Uyghur; morphemes; phrase-based; splitting granularity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Universal Communication Symposium (IUCS), 2010 4th International
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-7821-7
  • Type

    conf

  • DOI
    10.1109/IUCS.2010.5666183
  • Filename
    5666183