Title :
Discriminative reranking for SMT using various global features
Author :
Goh, Chooi-Ling ; Watanabe, Taro ; Finch, Andrew ; Sumita, Eiichiro
Author_Institution :
MASTAR Project, Nat. Inst. of Inf. & Commun. Technol., Keihanna Science City, Japan
Abstract :
In this paper, we propose to use various global features for discriminative reranking in an SMT framework. We employ an online large-margin based training algorithm for the structural output support vector machines based on the margin infused relaxed algorithm. Besides the standard features used, such as decoder´s scores, source and target sentences, alignments and part-of-speech tags, we include sentence type probabilities, posterior probabilities and back translation features for reranking. These features have been proved to be useful in other approaches in statistical machine translation but it is the first attempt to apply them in reranking. Our experimental results using 160K BTEC corpus show an improvement of 1-4 BLEU percentage points on Japanese/Chinese to English translation.
Keywords :
language translation; learning (artificial intelligence); natural language processing; statistical analysis; support vector machines; 160K BTEC corpus; BLEU percentage points; back translation features; discriminative reranking; part-of-speech tags; posterior probabilities; sentence type probabilities; statistical machine translation; structural output support vector machines; training algorithm; Data models; Feature extraction; Hidden Markov models; Probability; Support vector machines; Training; Training data;
Conference_Titel :
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7821-7
DOI :
10.1109/IUCS.2010.5666776