Title :
Improved punctuation recovery through combination of multiple speech streams
Author :
Miranda, Joao ; Neto, Joao Paulo ; Black, Alan W.
Author_Institution :
INESC-ID, Inst. Super. Tecnico, Lisbon, Portugal
Abstract :
In this paper, we present a technique to use the information in multiple parallel speech streams, which are approximate translations of each other, in order to improve performance in a punctuation recovery task. We first build a phraselevel alignment of these multiple streams, using phrase tables to link the phrase pairs together. The information so collected is then used to make it more likely that sentence units are equivalent across streams. We applied this technique to a number of simultaneously interpreted speeches of the European Parliament Committees, for the recovery of the full stop, in four different languages (English, Italian, Portuguese and Spanish). We observed an average improvement in SER of 37% when compared to an existing baseline, in Portuguese and English.
Keywords :
language translation; natural language processing; speech processing; speech recognition; ASR; English language; European Parliament Committees; Italian language; Portuguese language; SER improvement; Spanish language; approximate translations; automatic speech recognition; full stop recovery; machine translation; multiple parallel speech streams; multiple speech stream combination; phrase pairs; phrase tables; phrase-level alignment; punctuation recovery; sentence units; simultaneously interpreted speeches; Entropy; Europe; Feature extraction; Lattices; Measurement; Speech; Speech recognition; combination; machine translation; multistream; punctuation; speech recognition;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707718