Title :
Lattice-based Viterbi decoding techniques for speech translation
Author :
Saon, George ; Picheny, Michael
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights
Abstract :
We describe a cardinal-synchronous Viterbi decoder for statistical phrase-based machine translation which can operate on general ASR lattices (as opposed to confusion networks). The decoder implements constrained source reordering on the input lattice and makes use of an outbound distortion model to score the possible reorderings. The phrase table, representing the decoding search space, is encoded as a weighted finite state acceptor which is determined and minimized. At a high level, the search proceeds by performing simultaneous transitions in two pairs of automata: (input lattice, phrase table FSM) and (phrase table FSM, target language model). An alternative decoding strategy that we explore is to break the search into two independent subproblems: first, we perform monotone lattice decoding and find the best foreign path through the ASR lattice and then, we decode this path with reordering using standard sentence-based SMT. We report experimental results on several testsets of a large scale Arabic-to-English speech translation task in the context of the global autonomous language exploitation (or GALE) DARPA project. The results indicate that, for monotone search, lattice-based decoding outperforms 1-best decoding whereas for search with reordering, only the second decoding strategy was found to be superior to 1-best decoding. In both cases, the improvements hold only for shallow lattices.
Keywords :
Viterbi decoding; automata theory; language translation; natural language interfaces; speech processing; statistical analysis; automata theory; lattice-based Viterbi decoding technique; outbound distortion model; speech translation; standard sentence-based SMT; statistical phrase-based machine translation; weighted finite state acceptor; Automata; Automatic speech recognition; Decoding; Electronic mail; Error analysis; Large-scale systems; Lattices; Surface-mount technology; Testing; Viterbi algorithm;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430143