• DocumentCode
    1445842
  • Title

    Confidence measures for large vocabulary continuous speech recognition

  • Author

    Wessel, Frank ; Schlüter, Ralf ; Macherey, Klaus ; Ney, Hermann

  • Author_Institution
    Lehrstuhl fur Inf. VI, Tech. Hochschule Aachen, Germany
  • Volume
    9
  • Issue
    3
  • fYear
    2001
  • fDate
    3/1/2001 12:00:00 AM
  • Firstpage
    288
  • Lastpage
    298
  • Abstract
    In this paper, we present several confidence measures for large vocabulary continuous speech recognition. We propose to estimate the confidence of a hypothesized word directly as its posterior probability, given all acoustic observations of the utterance. These probabilities are computed on word graphs using a forward-backward algorithm. We also study the estimation of posterior probabilities on N-best lists instead of word graphs and compare both algorithms in detail. In addition, we compare the posterior probabilities with two alternative confidence measures, i.e., the acoustic stability and the hypothesis density. We present experimental results on five different corpora: the Dutch ARISE 1k evaluation corpus, the German Verbmobil ´98 7k evaluation corpus, the English North American Business ´94 20k and 64k development corpora, and the English Broadcast News ´96 65k evaluation corpus. We show that the posterior probabilities computed on word graphs outperform all other confidence measures. The relative reduction in confidence error rate ranges between 19% and 35% compared to the baseline confidence error rate
  • Keywords
    error statistics; graph theory; speech recognition; Dutch ARISE 1k evaluation corpus; English Broadcast News ´96 65k evaluation corpus; English North American Business ´94 20k corpus; English North American Business ´94 64k corpus; German Verbmobil ´98 7k evaluation corpus; N-best lists; acoustic stability; confidence measures; error rate; forward-backward algorithm; hypothesis density; hypothesized word; large vocabulary continuous speech recognition; posterior probability; utterance; word graphs; Acoustic measurements; Broadcasting; Density measurement; Error analysis; Error correction; Position measurement; Probability; Speech recognition; Stability; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.906002
  • Filename
    906002