• DocumentCode
    3531340
  • Title

    Posterior-based confidence measures for spoken term detection

  • Author

    Wang, Dong ; Tejedor, Javier ; Frankel, Joe ; King, Simon ; Colás, Jose

  • Author_Institution
    Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4889
  • Lastpage
    4892
  • Abstract
    Confidence measures play a key role in spoken term detection (STD) tasks. The confidence measure expresses the posterior probability of the search term appearing in the detection period, given the speech. Traditional approaches are based on the acoustic and language model scores for candidate detections found using automatic speech recognition, with Bayes´ rule being used to compute the desired posterior probability. In this paper, we present a novel direct posterior-based confidence measure which, instead of resorting to the Bayesian formula, calculates posterior probabilities from a multi-layer perceptron (MLP) directly. Compared with traditional Bayesian-based methods, the direct-posterior approach is conceptually and mathematically simpler. Moreover, the MLP-based model does not require assumptions to be made about the acoustic features such as their statistical distribution and the independence of static and dynamic co-efficients. Our experimental results in both English and Spanish demonstrate that the proposed direct posterior-based confidence improves STD performance.
  • Keywords
    Bayes methods; multilayer perceptrons; speech recognition; statistical distributions; Bayesian-based methods; acoustic model; automatic speech recognition; language model; multilayer perceptron; posterior probability; posterior-based confidence measures; spoken term detection; statistical distribution; Acoustic measurements; Acoustic signal detection; Bayesian methods; Hidden Markov models; Humans; Laboratories; Lattices; Natural languages; Probability; Speech recognition; MLP; Spoken term detection; confidence measure; posterior probabilities;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960727
  • Filename
    4960727