مرکز منطقه ای اطلاع رساني علوم و فناوري - Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection

DocumentCode :

1392605

Title :

Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection

Author :

Wallace, Roy ; Baker, Brendan ; Vogt, Robbie ; Sridharan, Sridha

Author_Institution :

Speech & Audio Res. Lab., Queensland Univ. of Technol., Brisbane, QLD, Australia

Volume :

Issue :

fYear :

2011

Firstpage :

1677

Lastpage :

1687

Abstract :

This paper proposes to improve spoken term detection (STD) accuracy by optimizing the figure of merit (FOM). In this paper, the index takes the form of a phonetic posterior-feature matrix. Accuracy is improved by formulating STD as a discriminative training problem and directly optimizing the FOM, through its use as an objective function to train a transformation of the index. The outcome of indexing is then a matrix of enhanced posterior-features that are directly tailored for the STD task. The technique is shown to improve the FOM by up to 13% on held-out data. Additional analysis explores the effect of the technique on phone recognition accuracy, examines the actual values of the learned transform, and demonstrates that using an extended training data set results in further improvement in the FOM.

Keywords :

optimisation; speech processing; discriminative optimization; discriminative training problem; figure of merit; objective function; phone recognition accuracy; phonetic posterior-feature matrix; phonetic spoken term detection; Accuracy; Hidden Markov models; Indexing; Measurement; Speech; Training; Information retrieval; speech processing; speech recognition; spoken term detection;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2096215

Filename :

5654582

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1392605