DocumentCode
284603
Title
Application of a generalized probabilistic descent method to dynamic time warping-based speech recognition
Author
Komori, Takshi ; Katagiri, Shigeru
Author_Institution
ATR Auditory & Visual Perception Res. Lab., Kyoto, Japan
Volume
1
fYear
1992
fDate
23-26 Mar 1992
Firstpage
497
Abstract
Although several kinds of discriminative training methods based on artificial neural networks have been vigorously tested, the pursuit of highly capable classification of variable-duration speech patterns has been unsatisfactory. In this light, the authors evaluate a generalized probabilistic descent method (GPD) in designing a speech recognizer incorporated with the dynamic time warping methodology. The algorithm can be viewed as generalized learning vector quantization suited to the dynamic programming-based time warping. Experiments were conducted on two tasks: English syllable classification and Japanese phoneme classification. Results clearly demonstrate that GPD can be a viable candidate for a method to realize a high-performance speech recognizer
Keywords
dynamic programming; learning (artificial intelligence); neural nets; speech recognition; vector quantisation; English syllable classification; GPD method; Japanese phoneme classification; artificial neural networks; discriminative training; dynamic programming-based time warping; dynamic time warping-based speech recognition; generalised learning VQ; generalized learning vector quantization; generalized probabilistic descent; high-performance speech recognizer; variable-duration speech patterns; Convergence; Cost function; Euclidean distance; Optimization methods; Particle measurements; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.225863
Filename
225863
Link To Document