DocumentCode :
1955698
Title :
Hierarchical Pitman-Yor Language Model for Machine Translation
Author :
Okita, Tsuyoshi ; Way, Andy
Author_Institution :
Sch. of Comput., Dublin City Univ., Dublin, Ireland
fYear :
2010
fDate :
28-30 Dec. 2010
Firstpage :
245
Lastpage :
248
Abstract :
The hierarchical Pitman-Yor process-based smoothing method applied to language model was proposed by Gold water and by The, the performance of this smoothing method is shown comparable with the modified Kneser-Ney method in terms of perplexity. Although this method was presented four years ago, there has been no paper which reports that this language model indeed improves translation quality in the context of Machine Translation (MT). This is important for the MT community since an improvement in perplexity does not always lead to an improvement in BLEU score, for example, the success of word alignment measured by Alignment Error Rate (AER) does not often lead to an improvement in BLEU. This paper reports in the context of MT that an improvement in perplexity really leads to an improvement in BLEU score. It turned out that an application of the Hierarchical Pitman-Yor Language Model (HPYLM) requires a minor change in the conventional decoding process. Additionally to this, we propose a new Pitman-Yor process-based statistical smoothing method similar to the Good-Turing method although the performance of this is inferior to HPYLM. We conducted experiments, HPYLM improved by 1.03 BLEU points absolute and 6% relative for 50k EN-JP, which was statistically significant.
Keywords :
computational linguistics; language translation; smoothing methods; statistical analysis; BLEU score; Good-Turing method; Kneser-Ney smoothing method; alignment error rate; decoding process; hierarchical Pitman-Yor language model; hierarchical Pitman-Yor process based statistical smoothing method; machine translation; word alignment; Computational modeling; Context; Decoding; Equations; Joints; Mathematical model; Smoothing methods; Hierarchical Pitman-Yor process; Statistical Machine Translation; Statistical smoothing method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2010 International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9063-9
Type :
conf
DOI :
10.1109/IALP.2010.34
Filename :
5681619
Link To Document :
بازگشت