• DocumentCode
    1712553
  • Title

    Confidence measures for detecting speech recognition errors

  • Author

    Gada, Jigar ; Rao, Preeti ; Samudravijaya, K

  • Author_Institution
    Department of Electrical Engineering, Indian Institute of Technology Bombay, Mumbai 400076, India
  • fYear
    2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Errors of speech recognition systems occur due to a variety of reasons. It is desirable to have a confidence measure that gives an idea of the accuracy of the decoder output, so that appropriate remedial measures can be taken. In this paper, we compare two approaches to detect incorrect output of a speech recognition system. The first approach employs multiple decoders, and uses a voting method to surmise confidence in the accuracy of the speech recognition system. The second approach uses a single decoder, but judiciously combines information at the segmental as well as supra segmental level to derive a measure of confidence in the output of the decoder. A neural network is trained with three features based on phone duration and one feature based on acoustic score. The output of the neural network is used to estimate the confidence in the output of the decoder. The two approaches are compared for their efficacy in detecting utterances that do not contain a valid input according to the task grammar as well as wrongly recognized valid inputs. It was observed that the second method achieves much better rejection of invalid input utterances as compared to the multi-decoder method, despite decoding a test utterance just once.
  • Keywords
    Accuracy; Acoustics; Artificial neural networks; Decoding; Noise measurement; Speech; Speech recognition; Automatic Speech Recognition; CMU-Sphinx; Confidence Measures; Out of Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2013 National Conference on
  • Conference_Location
    New Delhi, India
  • Print_ISBN
    978-1-4673-5950-4
  • Electronic_ISBN
    978-1-4673-5951-1
  • Type

    conf

  • DOI
    10.1109/NCC.2013.6487991
  • Filename
    6487991