Investigating translation of Parliament speeches

Author

Dechelotte, D. ; Schwenk, H. ; Gauvain, J.-L. ; Galibert, O. ; Lamel, L.

Author_Institution

LIMSI-CNRS, Orsay

fYear

2005

fDate

27-27 Nov. 2005

Firstpage

116

Lastpage

120

Abstract

This paper reports on recent experiments for speech to text (STT) translation of European Parliamentary speeches. A Spanish speech to English text translation system has been built using data from the TC-STAR European project. The speech recognizer is a state-of-the-art multipass system trained for the Spanish EPPS task and the statistical translation system relies on the IBM-4 model. First, MT results are compared using manual transcriptions and 1-best ASR hypotheses with different word error rates. Then, a n-best interface between the ASR and MT components is investigated to improve the STT process. Derivation of the fundamental equation for machine translation suggests that the source language model is not necessary for STT. This was investigated by using weak source language models and by n-best rescoring adding the acoustic model score only. A significant loss in the BLEU score was observed suggesting that the source language model is needed given the insufficiencies of the translation model. Adding the source language model score in the n-best rescoring process recovers the loss and slightly improves the BLEU score over the 1-best ASR hypothesis. The system achieves a BLEU score of 37.3 with an ASR word error rate of 10% and a BLEU score of 40.5 using the manual transcripts

Keywords

language translation; natural languages; speech recognition; speech synthesis; BLEU score; European Parliamentary speeches; machine translation; multipass system; source language model; speech recognizer; speech to text translation; Automatic speech recognition; Equations; Error analysis; Humans; Natural languages; Speech recognition; Speech synthesis; Statistics; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on

Conference_Location

San Juan

Print_ISBN

0-7803-9478-X

Electronic_ISBN

0-7803-9479-8

Type

conf

DOI

10.1109/ASRU.2005.1566514

Filename

1566514