DocumentCode
1619042
Title
Discriminative reranking for SMT using various global features
Author
Goh, Chooi-Ling ; Watanabe, Taro ; Finch, Andrew ; Sumita, Eiichiro
Author_Institution
MASTAR Project, Nat. Inst. of Inf. & Commun. Technol., Keihanna Science City, Japan
fYear
2010
Firstpage
8
Lastpage
14
Abstract
In this paper, we propose to use various global features for discriminative reranking in an SMT framework. We employ an online large-margin based training algorithm for the structural output support vector machines based on the margin infused relaxed algorithm. Besides the standard features used, such as decoder´s scores, source and target sentences, alignments and part-of-speech tags, we include sentence type probabilities, posterior probabilities and back translation features for reranking. These features have been proved to be useful in other approaches in statistical machine translation but it is the first attempt to apply them in reranking. Our experimental results using 160K BTEC corpus show an improvement of 1-4 BLEU percentage points on Japanese/Chinese to English translation.
Keywords
language translation; learning (artificial intelligence); natural language processing; statistical analysis; support vector machines; 160K BTEC corpus; BLEU percentage points; back translation features; discriminative reranking; part-of-speech tags; posterior probabilities; sentence type probabilities; statistical machine translation; structural output support vector machines; training algorithm; Data models; Feature extraction; Hidden Markov models; Probability; Support vector machines; Training; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location
Beijing
Print_ISBN
978-1-4244-7821-7
Type
conf
DOI
10.1109/IUCS.2010.5666776
Filename
5666776
Link To Document