• DocumentCode
    672341
  • Title

    Improved punctuation recovery through combination of multiple speech streams

  • Author

    Miranda, Joao ; Neto, Joao Paulo ; Black, Alan W.

  • Author_Institution
    INESC-ID, Inst. Super. Tecnico, Lisbon, Portugal
  • fYear
    2013
  • fDate
    8-12 Dec. 2013
  • Firstpage
    132
  • Lastpage
    137
  • Abstract
    In this paper, we present a technique to use the information in multiple parallel speech streams, which are approximate translations of each other, in order to improve performance in a punctuation recovery task. We first build a phraselevel alignment of these multiple streams, using phrase tables to link the phrase pairs together. The information so collected is then used to make it more likely that sentence units are equivalent across streams. We applied this technique to a number of simultaneously interpreted speeches of the European Parliament Committees, for the recovery of the full stop, in four different languages (English, Italian, Portuguese and Spanish). We observed an average improvement in SER of 37% when compared to an existing baseline, in Portuguese and English.
  • Keywords
    language translation; natural language processing; speech processing; speech recognition; ASR; English language; European Parliament Committees; Italian language; Portuguese language; SER improvement; Spanish language; approximate translations; automatic speech recognition; full stop recovery; machine translation; multiple parallel speech streams; multiple speech stream combination; phrase pairs; phrase tables; phrase-level alignment; punctuation recovery; sentence units; simultaneously interpreted speeches; Entropy; Europe; Feature extraction; Lattices; Measurement; Speech; Speech recognition; combination; machine translation; multistream; punctuation; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
  • Conference_Location
    Olomouc
  • Type

    conf

  • DOI
    10.1109/ASRU.2013.6707718
  • Filename
    6707718