DocumentCode :
2020991
Title :
Prototype-based MCE/GPD training for word spotting and connected word recognition
Author :
McDermott, Erik ; Katagiri, Shigeru
Author_Institution :
ATR Auditory & Visual Perception Res. Lab., Soraku-gun, Kyoto, Japan
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
291
Abstract :
A straightforward application of PBMEC (prototype-based minimum error classifier) training to existing techniques for handling continuous speech is described. A novel MCE/GPD (minimum classification error/generalized probabilistic descent) loss function that can incorporate word spotting errors and other measures of symbolic distance between correct and incorrect categories is defined. Classification consists in a time-synchronous DTW (dynamic time warping) pass through a finite state machine; adaptation makes use of an A* based N-best algorithm and consists in propagating the derivative of the loss over the N best paths through the finite state machine. The key feature is that the loss function being optimized closely reflects the actual recognition performance of the system.<>
Keywords :
errors; finite state machines; learning (artificial intelligence); neural nets; speech recognition; N-best algorithm; connected word recognition; dynamic time warping; errors; finite state machine; generalized probabilistic descent; loss function; prototype-based minimum error classifier; recognition performance; training; word spotting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319293
Filename :
319293
Link To Document :
بازگشت