DocumentCode
1445842
Title
Confidence measures for large vocabulary continuous speech recognition
Author
Wessel, Frank ; Schlüter, Ralf ; Macherey, Klaus ; Ney, Hermann
Author_Institution
Lehrstuhl fur Inf. VI, Tech. Hochschule Aachen, Germany
Volume
9
Issue
3
fYear
2001
fDate
3/1/2001 12:00:00 AM
Firstpage
288
Lastpage
298
Abstract
In this paper, we present several confidence measures for large vocabulary continuous speech recognition. We propose to estimate the confidence of a hypothesized word directly as its posterior probability, given all acoustic observations of the utterance. These probabilities are computed on word graphs using a forward-backward algorithm. We also study the estimation of posterior probabilities on N-best lists instead of word graphs and compare both algorithms in detail. In addition, we compare the posterior probabilities with two alternative confidence measures, i.e., the acoustic stability and the hypothesis density. We present experimental results on five different corpora: the Dutch ARISE 1k evaluation corpus, the German Verbmobil ´98 7k evaluation corpus, the English North American Business ´94 20k and 64k development corpora, and the English Broadcast News ´96 65k evaluation corpus. We show that the posterior probabilities computed on word graphs outperform all other confidence measures. The relative reduction in confidence error rate ranges between 19% and 35% compared to the baseline confidence error rate
Keywords
error statistics; graph theory; speech recognition; Dutch ARISE 1k evaluation corpus; English Broadcast News ´96 65k evaluation corpus; English North American Business ´94 20k corpus; English North American Business ´94 64k corpus; German Verbmobil ´98 7k evaluation corpus; N-best lists; acoustic stability; confidence measures; error rate; forward-backward algorithm; hypothesis density; hypothesized word; large vocabulary continuous speech recognition; posterior probability; utterance; word graphs; Acoustic measurements; Broadcasting; Density measurement; Error analysis; Error correction; Position measurement; Probability; Speech recognition; Stability; Vocabulary;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.906002
Filename
906002
Link To Document