Title :
Nonlinear dynamical system based acoustic modeling for ASR
Author :
Warakagoda, Narada D. ; Johnsen, Magne H.
Author_Institution :
Dept. of Telecommun., NTNU, Trondheim, Norway
Abstract :
The work presented is centered around a speech production model called the chained dynamical system model (CDSM) which is motivated by the fundamental limitations of the mainstream ASR approaches. The CDSM is essentially a smoothly time varying continuous state nonlinear dynamical system, consisting of two sub dynamical systems coupled as a chain so that one system controls the parameters of the next system. The speech recognition problem is posed as inverting the CDSM, for which we propose a solution based on the theory of embedding. The resulting architecture, which we call inverted CDSM (ICDSM) is evaluated in a set of experiments involving a speaker independent, continuous speech recognition task on the TIMIT database. Results of these experiments which can be compared with the corresponding results in the literature, confirm the feasibility and advantages of the approach
Keywords :
hidden Markov models; parameter estimation; pattern classification; speech recognition; TIMIT database; acoustic modeling; automatic speech recognition; chained dynamical system model; embedding theory; smoothly time varying continuous state nonlinear dynamical system; speaker independent continuous speech recognition; statistical pattern recognition; Automatic speech recognition; Control systems; Couplings; Databases; Nonlinear acoustics; Nonlinear control systems; Nonlinear dynamical systems; Production systems; Speech recognition; Time varying systems;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940883