DocumentCode
3317719
Title
Knowledge source construction in data-oriented English-Chinese machine translation
Author
Zhang, Yuejie ; Zhang, Tao
Author_Institution
Dept. of Comput. Sci. & Eng., Fudan Univ., Shanghai, China
fYear
2005
fDate
30 Oct.-1 Nov. 2005
Firstpage
404
Lastpage
409
Abstract
In data-oriented English-Chinese machine translation, knowledge source is the very important basis for translation processing. This paper presents a kind of construction strategy for knowledge source which contains affluent grammatical and syntactical information. Firstly, taking lexical function grammar as the theoretical basis, treebank including parse trees converted from every sentence in the source language corpus is acquired. Secondly, based on the decomposition algorithm, the corresponding fragment-bank composed of all the legal fragments extracted from the treebank is constructed. Finally, based on the combination algorithm, the fragment-combination-bank including all the possible fragment-combination forms of every parse tree in the treebank is built. Based on the successful construction of the knowledge source, the whole machine translation process can be implemented efficiently and accurately.
Keywords
computational linguistics; grammars; language translation; natural languages; data-oriented English-Chinese machine translation; knowledge source construction; legal fragment extraction; parse trees; treebank; Computer science; Data engineering; Data mining; Finance; Humans; Knowledge engineering; Laboratories; Law; Legal factors; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN
0-7803-9361-9
Type
conf
DOI
10.1109/NLPKE.2005.1598771
Filename
1598771
Link To Document