مرکز منطقه ای اطلاع رساني علوم و فناوري - Some insights from translating conversational telephone speech

DocumentCode :

178698

Title :

Some insights from translating conversational telephone speech

Author :

Kumar, Girish ; Post, Mike ; Povey, Daniel ; Khudanpur, Sanjeev

Author_Institution :

Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

3231

Lastpage :

3235

Abstract :

We report insights from translating Spanish conversational telephone speech into English text by cascading an automatic speech recognition (ASR) system with a statistical machine translation (SMT) system. The key new insight is that the informal register of conversational speech is a greater challenge for ASR than for SMT: the BLEU score for translating the reference transcript is 64%, but drops to 32% for translating automatic transcripts, whose word error rate (WER) is 40%. Several strategies are examined to mitigate the impact of ASR errors on the SMT output: (i) providing the ASR lattice, instead of the 1-best output, as input to the SMT system, (ii) training the SMT system on Spanish ASR output paired with English text, instead of Spanish reference transcripts, and (iii) improving the core ASR system. Each leads to consistent and complementary improvements in the SMT output. Compared to translating the 1-best output of an ASR system with 40% WER using an SMT system trained on Spanish reference transcripts, translating the output lattice of a better ASR system with 35% WER using an SMT system trained on ASR output improves BLEU from 32% to 38%.

Keywords :

error analysis; language translation; speech recognition; ASR errors; ASR lattice; BLEU score; English text; SMT output; SMT system; Spanish conversational telephone speech translation; Spanish reference transcripts; WER; automatic speech recognition system; automatic transcripts; core ASR system; informal register; statistical machine translation system; word error rate; Acoustics; Conferences; Lattices; Speech; Speech processing; Speech recognition; Training; Human Language Technology; Machine Translation; Natural Language Processing; Speech Recognition; Spoken Language Translation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854197

Filename :

6854197

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178698