DocumentCode :
2066630
Title :
Word Alignment Based on Multi-Grain Model
Author :
He, Yanqing ; Zhou, Yu ; Zong, Chengqing
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
Word alignment plays a critical role in statistical machine translation (SMT) and cross-language information retrieval. Until now, most existing methods get the word alignment within the whole range of the sentence length. The alignment quality is unsatisfactory. In this paper, we propose a novel approach to word alignment based on multi-grain model (WAMG). We split a parallel sentence pair into blocks in different grain and get the word alignments within each corresponding block. Our approach is able to restrict the search space of word alignment in the relatively accurate local range and reduce the mapping error. The experiments have shown that our approach outperforms the traditional word alignment algorithm relatively by about 12% in AER and improves the performance of Chinese-to-English translation system relatively by about 2.8% in BLEU.
Keywords :
information retrieval; language translation; word processing; Chinese-to-English translation system; cross-language information retrieval; multi-grain model; statistical machine translation; word alignment; Automation; Entropy; Helium; Hidden Markov models; Information retrieval; Merging; Natural languages; Neural networks; Surface-mount technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
Type :
conf
DOI :
10.1109/CHINSL.2008.ECP.79
Filename :
4730333
Link To Document :
بازگشت