• DocumentCode
    2180655
  • Title

    Training of error-corrective model for ASR without using audio data

  • Author

    Kurata, Gakuto ; Itoh, Nobuyasu ; Nishimura, Masafumi

  • Author_Institution
    IBM Res. - Tokyo, Yamato, Japan
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5576
  • Lastpage
    5579
  • Abstract
    This paper introduces a method to train an error-corrective model for Automatic Speech Recognition (ASR) without using audio data. In existing techniques, it is assumed that sufficient audio data of the tar get application is available and negative samples can be prepared by having ASR recognize this audio data. However, this assumption is not always true. We propose generating probable N-best lists, which the ASR may produce, directly from the text data of the target application by taking phoneme similarity into consideration. We call this process "Pseudo-ASR". We conduct discriminative reranking with the error-corrective model by regarding the text data as positive samples and the N-best lists from the Pseudo-ASR as negative samples. Experiments with Japanese call center data showed that discriminative reranking based on the Pseudo-ASR improved the accuracy of the ASR.
  • Keywords
    speech recognition; N-best lists; audio data; automatic speech recognition; error-corrective model; pseudo-ASR process; Indexes; Call Center; Discriminative Reranking; Error-corrective Model; Large Vocabulary Continuous Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947623
  • Filename
    5947623