• DocumentCode
    1493129
  • Title

    Calibration of Confidence Measures in Speech Recognition

  • Author

    Yu, Dong ; Li, Jinyu ; Deng, Li

  • Author_Institution
    Microsoft Res., Redmond, WA, USA
  • Volume
    19
  • Issue
    8
  • fYear
    2011
  • Firstpage
    2461
  • Lastpage
    2473
  • Abstract
    Most speech recognition applications in use today rely heavily on confidence measure for making optimal decisions. In this paper, we aim to answer the question: what can be done to improve the quality of confidence measure if we cannot modify the speech recognition engine? The answer provided in this paper is a post-processing step called confidence calibration, which can be viewed as a special adaptation technique applied to confidence measure. Three confidence calibration methods have been developed in this work: the maximum entropy model with distribution constraints, the artificial neural network, and the deep belief network. We compare these approaches and demonstrate the importance of key features exploited: the generic confidence-score, the application-dependent word distribution, and the rule coverage ratio. We demonstrate the effectiveness of confidence calibration on a variety of tasks with significant normalized cross entropy increase and equal error rate reduction.
  • Keywords
    belief networks; decision making; maximum entropy methods; neural nets; speech recognition; application-dependent word distribution; artificial neural network; confidence calibration; deep belief network; generic confidence score; maximum entropy model; optimal decision making; rule coverage ratio; speech recognition; Artificial neural networks; Calibration; Engines; Entropy; Semantics; Speech; Speech recognition; Confidence calibration; confidence measure; deep belief network; distribution constraint; maximum entropy; word distribution;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2141988
  • Filename
    5749278