DocumentCode :
2896086
Title :
Automatic Extraction of Chinese/Japanese Translation Patterns Using Prefix Span
Author :
Qian, Wang ; Komiya, Kanako ; Kotani, Yoshiyuki
Author_Institution :
Grad. Sch. of Eng., Tokyo Univ. of Agric. & Technol., Tokyo, Japan
fYear :
2011
fDate :
11-13 Nov. 2011
Firstpage :
139
Lastpage :
144
Abstract :
In late years, a large number of translation patterns are required for the pattern based machine translation. We propose an efficient method to extract the Japanese/Chinese translation patterns from the corpora using Prefix Span. They performed chunking on the sentence pairs of the parallel corpora, collected the candidate translation patterns from them using Prefix Span, and narrow down the candidates using two criteria: the point wise mutual information (PMI) and the degree of confidence for the threshold values. The proposed method achieved precision 85% when the PMI is 1.0 and the degree of confidence is 0.15.
Keywords :
language translation; natural language processing; Chinese-Japanese translation pattern automatic extraction; candidate translation patterns; parallel corpora; pattern based machine translation; point wise mutual information; prefix span; sentence pair chunking; threshold values confidence degree; Artificial intelligence; Chinese; Japanese; Prefix Span; translation pattern;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2011 International Conference on
Conference_Location :
Chung-Li
Print_ISBN :
978-1-4577-2174-8
Type :
conf
DOI :
10.1109/TAAI.2011.31
Filename :
6120733
Link To Document :
بازگشت