• DocumentCode
    2875675
  • Title

    Investigating translation of Parliament speeches

  • Author

    Dechelotte, D. ; Schwenk, H. ; Gauvain, J.-L. ; Galibert, O. ; Lamel, L.

  • Author_Institution
    LIMSI-CNRS, Orsay
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    116
  • Lastpage
    120
  • Abstract
    This paper reports on recent experiments for speech to text (STT) translation of European Parliamentary speeches. A Spanish speech to English text translation system has been built using data from the TC-STAR European project. The speech recognizer is a state-of-the-art multipass system trained for the Spanish EPPS task and the statistical translation system relies on the IBM-4 model. First, MT results are compared using manual transcriptions and 1-best ASR hypotheses with different word error rates. Then, a n-best interface between the ASR and MT components is investigated to improve the STT process. Derivation of the fundamental equation for machine translation suggests that the source language model is not necessary for STT. This was investigated by using weak source language models and by n-best rescoring adding the acoustic model score only. A significant loss in the BLEU score was observed suggesting that the source language model is needed given the insufficiencies of the translation model. Adding the source language model score in the n-best rescoring process recovers the loss and slightly improves the BLEU score over the 1-best ASR hypothesis. The system achieves a BLEU score of 37.3 with an ASR word error rate of 10% and a BLEU score of 40.5 using the manual transcripts
  • Keywords
    language translation; natural languages; speech recognition; speech synthesis; BLEU score; European Parliamentary speeches; machine translation; multipass system; source language model; speech recognizer; speech to text translation; Automatic speech recognition; Equations; Error analysis; Humans; Natural languages; Speech recognition; Speech synthesis; Statistics; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566514
  • Filename
    1566514