DocumentCode :
3531340
Title :
Posterior-based confidence measures for spoken term detection
Author :
Wang, Dong ; Tejedor, Javier ; Frankel, Joe ; King, Simon ; Colás, Jose
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4889
Lastpage :
4892
Abstract :
Confidence measures play a key role in spoken term detection (STD) tasks. The confidence measure expresses the posterior probability of the search term appearing in the detection period, given the speech. Traditional approaches are based on the acoustic and language model scores for candidate detections found using automatic speech recognition, with Bayes´ rule being used to compute the desired posterior probability. In this paper, we present a novel direct posterior-based confidence measure which, instead of resorting to the Bayesian formula, calculates posterior probabilities from a multi-layer perceptron (MLP) directly. Compared with traditional Bayesian-based methods, the direct-posterior approach is conceptually and mathematically simpler. Moreover, the MLP-based model does not require assumptions to be made about the acoustic features such as their statistical distribution and the independence of static and dynamic co-efficients. Our experimental results in both English and Spanish demonstrate that the proposed direct posterior-based confidence improves STD performance.
Keywords :
Bayes methods; multilayer perceptrons; speech recognition; statistical distributions; Bayesian-based methods; acoustic model; automatic speech recognition; language model; multilayer perceptron; posterior probability; posterior-based confidence measures; spoken term detection; statistical distribution; Acoustic measurements; Acoustic signal detection; Bayesian methods; Hidden Markov models; Humans; Laboratories; Lattices; Natural languages; Probability; Speech recognition; MLP; Spoken term detection; confidence measure; posterior probabilities;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960727
Filename :
4960727
Link To Document :
بازگشت