• DocumentCode
    3317777
  • Title

    Maximum entropy based Chinese-Japanese word alignment

  • Author

    Guo, Hongmei ; Wang, Xiaojie

  • Author_Institution
    Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., China
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    437
  • Lastpage
    442
  • Abstract
    Word alignment can be used for numerous applications in natural language processing, such as lexicography, machine translation and so on. In this paper, we present an efficient context-dependent word alignment model based on maximum entropy (ME) approach. All knowledge sources are treated as feature functions in this model, such as source words, POS information and bilingual dictionary. This approach allows easy integration of more context-dependent information. We perform experiments on a Chinese-Japanese parallel corpus and the results are compared with a manually produced reference alignment. We achieve good alignment accuracy in a very noisy environment using unsupervised train method. And the effects of different features are also evaluated.
  • Keywords
    context-free grammars; language translation; maximum entropy methods; natural languages; Chinese-Japanese word alignment; POS information; bilingual dictionary; context-dependent information; lexicography; machine translation; maximum entropy approach; natural language processing; noisy environment; unsupervised train method; Context modeling; Dictionaries; Entropy; Machine learning; Natural language processing; Natural languages; Parameter estimation; Terminology; Testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598777
  • Filename
    1598777