Title :
Lattice-Based ASR-MT Interface for Speech Translation
Author :
Matusov, Evgeny ; Ney, Hermann
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
fDate :
5/1/2011 12:00:00 AM
Abstract :
The usual approach to improve the interface between automatic speech recognition (ASR) and machine translation (MT) is to use ASR word lattices for translation. In comparison with the previous research along this line, this paper presents an efficient algorithm for lattice-based search in MT. This algorithm utilizes confusion network information to enable phrase-level reordering, and is also able to process general lattices. The proposed search is not constrained to be monotonic; thus, it is able to perform the same type of reordering given lattice input as any statistical phrase-based search algorithm with a single sentence input. Using the concept described in this paper, we are able to significantly improve speech translation results on several small and large vocabulary tasks. The improvements of the MT quality as measured by BLEU are as high as 5% relative. We also show that the proposed lattice-based translation can outperform state-of-the-art translation of confusion networks and has advantages in terms of translation speed. Furthermore, we propose and evaluate a novel approach that shares the benefits of lattice-based translation with those translation systems which are not designed to process word lattices.
Keywords :
language translation; search problems; speech recognition; vocabulary; ASR word lattices; MT quality; automatic speech recognition; confusion network information; large vocabulary task; lattice-based ASR-MT interface; lattice-based search; machine translation; phrase-level reordering; speech translation; statistical phrase-based search algorithm; Machine translation (MT); SLP-SSMT; natural languages; speech processing;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2060483