• DocumentCode
    1392605
  • Title

    Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection

  • Author

    Wallace, Roy ; Baker, Brendan ; Vogt, Robbie ; Sridharan, Sridha

  • Author_Institution
    Speech & Audio Res. Lab., Queensland Univ. of Technol., Brisbane, QLD, Australia
  • Volume
    19
  • Issue
    6
  • fYear
    2011
  • Firstpage
    1677
  • Lastpage
    1687
  • Abstract
    This paper proposes to improve spoken term detection (STD) accuracy by optimizing the figure of merit (FOM). In this paper, the index takes the form of a phonetic posterior-feature matrix. Accuracy is improved by formulating STD as a discriminative training problem and directly optimizing the FOM, through its use as an objective function to train a transformation of the index. The outcome of indexing is then a matrix of enhanced posterior-features that are directly tailored for the STD task. The technique is shown to improve the FOM by up to 13% on held-out data. Additional analysis explores the effect of the technique on phone recognition accuracy, examines the actual values of the learned transform, and demonstrates that using an extended training data set results in further improvement in the FOM.
  • Keywords
    optimisation; speech processing; discriminative optimization; discriminative training problem; figure of merit; objective function; phone recognition accuracy; phonetic posterior-feature matrix; phonetic spoken term detection; Accuracy; Hidden Markov models; Indexing; Measurement; Speech; Training; Information retrieval; speech processing; speech recognition; spoken term detection;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2096215
  • Filename
    5654582