DocumentCode :
302336
Title :
A phone-dependent confidence measure for utterance rejection
Author :
Rivlin, Ze´ev ; Cohen, Michael ; Abrash, Victor ; Chung, Thomas
Author_Institution :
Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
515
Abstract :
An acoustic confidence measure for acceptance/rejection of recognition hypotheses for continuous speech utterances is proposed. This measure is useful for rejecting utterances that are out of domain, or contain out-of-vocabulary words or speech disfluencies. A phone-based approach is implemented so that a single global threshold can be applied to hypothesis rejection for any word sequence. Phone confidence is computed for each frame of speech as the posterior phone probability given the acoustic observation. Word sequence confidence is evaluated as the average phone confidence, either by weighting all frames equally or by normalizing by phone duration. The confidence measure is tested on a database of spoken company names. When normalized by phone duration, it achieves, in some cases with less computational expense, rejection performance comparable to a baseline system implementing a common filler-model approach. When all frames are equally weighted, performance is substantially poorer
Keywords :
acoustic signal processing; probability; speech processing; speech recognition; acoustic confidence measure; acoustic observation; average phone confidence; baseline system; continuous speech utterances; equally weighted frames; filler model; global threshold; hypothesis rejection; out of vocabulary words; phone dependent confidence measure; phone duration; posterior phone probability; recognition hypotheses; rejection performance; speech disfluencies; speech frame; spoken company names database; utterance acceptance; utterance rejection; word sequence; word sequence confidence; Acoustic measurements; Acoustic testing; Context modeling; Databases; Equations; Hidden Markov models; Laboratories; Natural languages; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.541146
Filename :
541146
Link To Document :
بازگشت