DocumentCode
498897
Title
English-Chinese OOV translation based on PAT Tree
Author
Wang, Yang ; Zhang, Yue-jie ; Zhang, Tao
Author_Institution
Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
Volume
3
fYear
2009
fDate
12-15 July 2009
Firstpage
1732
Lastpage
1736
Abstract
In Cross-Language Information Retrieval (CLIR) process, Out-Of-Vocabulary (OOV) or the unknown word translation is a significant and challenging issue. Specifically, for English-Chinese OOV translation, OOV term detection and extraction of translation pair still remain to be key problems. In this paper, an English-Chinese OOV translation pattern based on PAT-Tree is proposed. Web-mining is utilized as the corpus source to collect translation pairs, and translation candidates are acquired by Chinese OOV term extraction based on PAT-Tree. The experimental results show that the proposed approach can outperform some of the current translation engines, and is especially efficient in English-Chinese OOV translation.
Keywords
Internet; data mining; information retrieval; language translation; search engines; English-Chinese out-of-vocabulary translation; OOV term detection; OOV term extraction; Web-mining; cross-language information retrieval; translation engines; Cybernetics; Machine learning; Cross-Language Information Retrieval (CLIR); English-Chinese OOV translation; Out-of-Vocabulary (OOV); PAT-Tree; term extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location
Baoding
Print_ISBN
978-1-4244-3702-3
Electronic_ISBN
978-1-4244-3703-0
Type
conf
DOI
10.1109/ICMLC.2009.5212280
Filename
5212280
Link To Document