Title :
Discriminative articulatory models for spoken term detection in low-resource conversational settings
Author :
Prabhavalkar, Rohit ; Livescu, Karen ; Fosler-Lussier, Eric ; Keshet, Joseph
Author_Institution :
Ohio State Univ., Columbus, OH, USA
Abstract :
We study spoken term detection (STD) - the task of determining whether and where a given word or phrase appears in a given segment of speech - using articulatory feature-based pronunciation models. The models are motivated by the requirements of STD in low-resource settings, in which it may not be feasible to train a large-vocabulary continuous speech recognition system, as well as by the need to address pronunciation variation in conversational speech. Our STD system is trained to maximize the expected area under the receiver operating characteristic curve, often used to evaluate STD performance. In experimental evaluations on the Switchboard corpus, we find that our approach outperforms a baseline HMM-based system across a number of training set sizes, as well as a discriminative phone-based model in some settings.
Keywords :
learning (artificial intelligence); speech recognition; HMM-based system; STD system; Switchboard corpus; articulatory feature-based pronunciation models; discriminative articulatory models; discriminative phone-based model; large-vocabulary continuous speech recognition system; low-resource conversational settings; spoken term detection; Acoustics; Context modeling; Hidden Markov models; Speech; Speech recognition; Switches; Training; AUC; articulatory features; discriminative training; spoken term detection; structural SVM;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639281