DocumentCode :
284603
Title :
Application of a generalized probabilistic descent method to dynamic time warping-based speech recognition
Author :
Komori, Takshi ; Katagiri, Shigeru
Author_Institution :
ATR Auditory & Visual Perception Res. Lab., Kyoto, Japan
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
497
Abstract :
Although several kinds of discriminative training methods based on artificial neural networks have been vigorously tested, the pursuit of highly capable classification of variable-duration speech patterns has been unsatisfactory. In this light, the authors evaluate a generalized probabilistic descent method (GPD) in designing a speech recognizer incorporated with the dynamic time warping methodology. The algorithm can be viewed as generalized learning vector quantization suited to the dynamic programming-based time warping. Experiments were conducted on two tasks: English syllable classification and Japanese phoneme classification. Results clearly demonstrate that GPD can be a viable candidate for a method to realize a high-performance speech recognizer
Keywords :
dynamic programming; learning (artificial intelligence); neural nets; speech recognition; vector quantisation; English syllable classification; GPD method; Japanese phoneme classification; artificial neural networks; discriminative training; dynamic programming-based time warping; dynamic time warping-based speech recognition; generalised learning VQ; generalized learning vector quantization; generalized probabilistic descent; high-performance speech recognizer; variable-duration speech patterns; Convergence; Cost function; Euclidean distance; Optimization methods; Particle measurements; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225863
Filename :
225863
Link To Document :
بازگشت