• DocumentCode
    3317719
  • Title

    Knowledge source construction in data-oriented English-Chinese machine translation

  • Author

    Zhang, Yuejie ; Zhang, Tao

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai, China
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    404
  • Lastpage
    409
  • Abstract
    In data-oriented English-Chinese machine translation, knowledge source is the very important basis for translation processing. This paper presents a kind of construction strategy for knowledge source which contains affluent grammatical and syntactical information. Firstly, taking lexical function grammar as the theoretical basis, treebank including parse trees converted from every sentence in the source language corpus is acquired. Secondly, based on the decomposition algorithm, the corresponding fragment-bank composed of all the legal fragments extracted from the treebank is constructed. Finally, based on the combination algorithm, the fragment-combination-bank including all the possible fragment-combination forms of every parse tree in the treebank is built. Based on the successful construction of the knowledge source, the whole machine translation process can be implemented efficiently and accurately.
  • Keywords
    computational linguistics; grammars; language translation; natural languages; data-oriented English-Chinese machine translation; knowledge source construction; legal fragment extraction; parse trees; treebank; Computer science; Data engineering; Data mining; Finance; Humans; Knowledge engineering; Laboratories; Law; Legal factors; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598771
  • Filename
    1598771