Title :
Phonetic Distance Based Confidence Measure
Author :
Kim, Wooil ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
This letter presents a novel confidence measure for the purpose of improving user performance in Spoken Document Retrieval (SDR). The proposed confidence measure is based on the phonetic distance between subword models, employing an anti-model which is determined to be discriminative to a target model using offline training data. As an advancement from our previous work, the proposed method employs separate phonetic similarity knowledge for vowels and consonants, resulting in more reliable performance over diverse SDR recorded speech conditions. A transcript reliability estimator is also presented, with evaluation as an application of the proposed confidence measure. Analysis on a variety of corpora including background noise, frequency band-restrictions, and a range of real-life conditions, shows that the proposed confidence measure is more reliable in detecting corrupted speech due to acoustic conditions or an unarticulated speaking style, providing a higher correlation to word error rate (WER). The proposed confidence measure is effective in increasing transcript reliability estimation performance with a 16.21% relative improvement.
Keywords :
document handling; speech processing; anti-model; confidence measure; corrupted speech detection; offline training data; phonetic distance; phonetic similarity knowledge; spoken document retrieval; subword model; transcript reliability estimation performance; transcript reliability estimator; unarticulated speaking style; user performance; word error rate; Anti-model; confidence measure; phonetic distance; reliability estimation; spoken document retrieval;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2009.2034551