DocumentCode :
2267580
Title :
A novel approach to sentence alignment from comparable corpora
Author :
Li, Min-Hsiang ; Klyuev, Vitaly ; Wu, Shih-Hung
Author_Institution :
Software Eng. Lab., Univ. of Aizu, Aizuwakamatsu, Japan
Volume :
2
fYear :
2011
fDate :
15-17 Sept. 2011
Firstpage :
618
Lastpage :
623
Abstract :
This paper introduces a new technique to select candidate sentences for alignment from bilingual comparable corpora. Tests were done utilizing Wikipedia as a source for bilingual data. Our test languages are English and Chinese. A high quality of sentence alignment is illustrated by a machine translation application.
Keywords :
language translation; text analysis; Chinese language; English language; Wikipedia; bilingual comparable corpora; bilingual data; machine translation; sentence alignment; Electronic publishing; Encyclopedias; Google; Information retrieval; Internet; Probability; allignment; corpus; information retrieval; text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Data Acquisition and Advanced Computing Systems (IDAACS), 2011 IEEE 6th International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-1426-9
Type :
conf
DOI :
10.1109/IDAACS.2011.6072842
Filename :
6072842
Link To Document :
بازگشت