Title :
Multi-feature representation for Web-based English-Chinese OOV term translation
Author :
Zhang, Yue-jie ; Su, Yan-xia ; Jin, Cheng ; Zhang, Tao
Author_Institution :
Sch. of Comput. Sci., Fudan Univ., Shanghai, China
Abstract :
This paper focuses on the Web-based English-Chinese Out-of-Vocabulary (OOV) term translation pattern, and emphasizes particularly on the selection strategy based on the multi-feature representation for translation evaluation. Three kinds of feature, local feature, global feature and Boolean feature, are extracted from translation candidates based on the fusion strategy of multi-features. By utilizing the CoNLL 2003 corpus for the English Named Entity Recognition (NER) task, the related experiments based on such a standard data source show the promising results. The established multi-feature representation mechanism for English-Chinese OOV term translation model can “filter” the most possible translation candidate with better ability.
Keywords :
Internet; computational linguistics; feature extraction; language translation; pattern recognition; sensor fusion; vocabulary; CoNLL 2003 corpus; Web-based English-Chinese OOV term translation; local-global-Boolean feature extraction; multifeature representation; named entity recognition; out-of-vocabulary; selection strategy; standard data source; Accuracy; Cybernetics; Feature extraction; Machine learning; Semantics; Web mining; Boolean feature; Out-of-Vocabulary (OOV) term translation; global feature; local feature; multi-feature representation;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2011 International Conference on
Conference_Location :
Guilin
Print_ISBN :
978-1-4577-0305-8
DOI :
10.1109/ICMLC.2011.6016971