• DocumentCode
    177442
  • Title

    Reranked aligners for interactive transcript correction

  • Author

    Favre, Benoit ; Rouvier, Mickael ; Bechet, Frederic

  • Author_Institution
    Aix-Marseille Univ., Marseille, France
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    146
  • Lastpage
    150
  • Abstract
    Clarification dialogs can help address ASR errors in speech-to-speech translation systems and other interactive applications. We propose to use variants of Levenshtein alignment for merging an er-rorful utterance with a targeted rephrase of an error segment. ASR errors that might harm the alignment are addressed through phonetic matching, and a word embedding distance is used to account for the use of synonyms outside targeted segments. These features lead to a relative improvement of 30% of word error rate on sentences with ASR errors compared to not performing the clarification. Twice as many utterances are completely corrected compared to using basic word alignment. Furthermore, we generate a set of potential merges and train a neural network on crowd-sourced rephrases in order to select the best merger, leading to 24% more instances completely corrected. The system is deployed in the framework of the BOLT project.
  • Keywords
    interactive systems; neural nets; speech recognition; ASR errors; Levenshtein alignment; automatic speech recognition system; clarification dialogs; dialog system; error correction; interactive transcript correction; neural network; phonetic matching; reranked aligners; speech-to-speech translation system; word embedding distance; Accuracy; Corporate acquisitions; Fasteners; Merging; Semantics; Speech; Transducers; ASR error detection; Dialog systems; Error correction; Levenshtein alignment; Reranking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853575
  • Filename
    6853575