• DocumentCode
    2875297
  • Title

    A new decoder for spoken language translation based on confusion networks

  • Author

    Bertoldi, Nicola ; Federico, Marcello

  • Author_Institution
    ITC, Centro per la Ricerca Sci. e Tecnologica, Trento
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    86
  • Lastpage
    91
  • Abstract
    A novel approach to spoken language translation is proposed, which more tightly integrates automatic speech recognition (ASR) and statistical machine translation (SMT). SMT is directly applied on an approximation of the word graph produced by the ASR system, namely a confusion network. The decoding algorithm extends a conventional phrase-based decoder in that it can process at once a large number of source sentence hypotheses contained in the confusion network. Experimental results are presented on a Spanish-English large vocabulary task, namely the translation of the European Parliament plenary sessions. With respect to a conventional SMT decoder processing N-best lists, a slight improvement in the BLEU score is reported as well as a significantly lower decoding time
  • Keywords
    decoding; language translation; natural languages; speech recognition; vocabulary; word processing; European Parliament plenary sessions; Spanish-English large vocabulary; automatic speech recognition; confusion network; decoding algorithm; source sentence hypotheses; spoken language translation; statistical machine translation; word graph; Acoustic transducers; Automatic speech recognition; Decoding; Entropy; Natural languages; Polynomials; Surface-mount technology; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566492
  • Filename
    1566492