DocumentCode
2875675
Title
Investigating translation of Parliament speeches
Author
Dechelotte, D. ; Schwenk, H. ; Gauvain, J.-L. ; Galibert, O. ; Lamel, L.
Author_Institution
LIMSI-CNRS, Orsay
fYear
2005
fDate
27-27 Nov. 2005
Firstpage
116
Lastpage
120
Abstract
This paper reports on recent experiments for speech to text (STT) translation of European Parliamentary speeches. A Spanish speech to English text translation system has been built using data from the TC-STAR European project. The speech recognizer is a state-of-the-art multipass system trained for the Spanish EPPS task and the statistical translation system relies on the IBM-4 model. First, MT results are compared using manual transcriptions and 1-best ASR hypotheses with different word error rates. Then, a n-best interface between the ASR and MT components is investigated to improve the STT process. Derivation of the fundamental equation for machine translation suggests that the source language model is not necessary for STT. This was investigated by using weak source language models and by n-best rescoring adding the acoustic model score only. A significant loss in the BLEU score was observed suggesting that the source language model is needed given the insufficiencies of the translation model. Adding the source language model score in the n-best rescoring process recovers the loss and slightly improves the BLEU score over the 1-best ASR hypothesis. The system achieves a BLEU score of 37.3 with an ASR word error rate of 10% and a BLEU score of 40.5 using the manual transcripts
Keywords
language translation; natural languages; speech recognition; speech synthesis; BLEU score; European Parliamentary speeches; machine translation; multipass system; source language model; speech recognizer; speech to text translation; Automatic speech recognition; Equations; Error analysis; Humans; Natural languages; Speech recognition; Speech synthesis; Statistics; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location
San Juan
Print_ISBN
0-7803-9478-X
Electronic_ISBN
0-7803-9479-8
Type
conf
DOI
10.1109/ASRU.2005.1566514
Filename
1566514
Link To Document