• DocumentCode
    3558779
  • Title

    Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation

  • Author

    Khadivi, Shahram ; Ney, Hermann

  • Author_Institution
    Dept. of Comput. Sci., RWTH Aachen Univ., Aachen
  • Volume
    16
  • Issue
    8
  • fYear
    2008
  • Firstpage
    1551
  • Lastpage
    1564
  • Abstract
    Parallel integration of automatic speech recognition (ASR) models and statistical machine translation (MT) models is an unexplored research area in comparison to the large amount of works done on integrating them in series, i.e., speech-to-speech translation. Parallel integration of these models is possible when we have access to the speech of a target language text and to its corresponding source language text, like a computer-assisted translation system. To our knowledge, only a few methods for integrating ASR models with MT models in parallel have been studied. In this paper, we systematically study a number of different translation models in the context of the N-best list rescoring. As an alternative to the N -best list rescoring, we use ASR word graphs in order to arrive at a tighter integration of ASR and MT models. The experiments are carried out on two tasks: English-to-German with an ASR vocabulary size of 17 K words, and Spanish-to-English with an ASR vocabulary of 58 K words. For the best method, the MT models reduce the ASR word error rate by a relative of 18% and 29% on the 17 K and the 58 K tasks, respectively.
  • Keywords
    computational linguistics; graph theory; integration; language translation; speech recognition; English-to-German; Spanish-to-English; automatic speech recognition; computer-assisted translation; machine translation; parallel integration; source language text; speech-to-speech translation; statistical machine translation models; word graphs; Automatic speech recognition; Concurrent computing; Context modeling; Engines; Error analysis; Helium; Humans; Natural languages; Speech recognition; Vocabulary; Computer-assisted translation (CAT); speech recognition; statistical machine translation (MT);
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.2004301
  • Filename
    4648933