DocumentCode :
3585005
Title :
Incremental translation using hierarchichal phrase-based translation system
Author :
Siahbani, Maryam ; Seraj, Ramtin Mehdizadeh ; Sankaran, Baskaran ; Sarkar, Anoop
Author_Institution :
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC, Canada
fYear :
2014
Firstpage :
71
Lastpage :
76
Abstract :
Hierarchical phrase-based machine translation [1] (Hiero) is a prominent approach for Statistical Machine Translation usually comparable to or better than conventional phrase-based systems. But Hiero typically uses the CKY decoding algorithm which requires the entire input sentence before decoding begins, as it produces the translation in a bottom-up fashion. Left-to-right (LR) decoding [2] is a promising decoding algorithm for Hiero that produces the output translation in left to right order. In this paper we focus on simultaneous translation using the Hiero translation framework. In simultaneous translation, translations are generated incrementally as source language speech input is processed. We propose a novel approach for incremental translation by integrating segmentation and decoding in LR-Hiero. We compare two incremental decoding algorithms for LR-Hiero and present translation quality scores (BLEU) and the latency of generating translations for both decoders on audio lectures from the TED collection.
Keywords :
language translation; natural language processing; speech coding; statistical analysis; BLEU; CKY decoding algorithm; Hiero translation framework; LR decoding; LR-Hiero; TED collection; audio lectures; hierarchical phrase-based machine translation system; incremental decoding algorithm; incremental translation; left-to-right decoding; segmentation; source language speech input; statistical machine translation; translation quality scores; Abstracts; Decoding; Feature extraction; History; Joints; Training; Hierarchical Phrase-based Translation (Hiero); Incremental Decoding; Left-to-Right Decoding; Statistical Machine Translation (SMT);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type :
conf
DOI :
10.1109/SLT.2014.7078552
Filename :
7078552
Link To Document :
بازگشت