Nonlinear dynamical system based acoustic modeling for ASR

Author

Warakagoda, Narada D. ; Johnsen, Magne H.

Author_Institution

Dept. of Telecommun., NTNU, Trondheim, Norway

Volume

1

fYear

2001

fDate

2001

Firstpage

525

Abstract

The work presented is centered around a speech production model called the chained dynamical system model (CDSM) which is motivated by the fundamental limitations of the mainstream ASR approaches. The CDSM is essentially a smoothly time varying continuous state nonlinear dynamical system, consisting of two sub dynamical systems coupled as a chain so that one system controls the parameters of the next system. The speech recognition problem is posed as inverting the CDSM, for which we propose a solution based on the theory of embedding. The resulting architecture, which we call inverted CDSM (ICDSM) is evaluated in a set of experiments involving a speaker independent, continuous speech recognition task on the TIMIT database. Results of these experiments which can be compared with the corresponding results in the literature, confirm the feasibility and advantages of the approach

Keywords

hidden Markov models; parameter estimation; pattern classification; speech recognition; TIMIT database; acoustic modeling; automatic speech recognition; chained dynamical system model; embedding theory; smoothly time varying continuous state nonlinear dynamical system; speaker independent continuous speech recognition; statistical pattern recognition; Automatic speech recognition; Control systems; Couplings; Databases; Nonlinear acoustics; Nonlinear control systems; Nonlinear dynamical systems; Production systems; Speech recognition; Time varying systems;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location

Salt Lake City, UT

ISSN

1520-6149

Print_ISBN

0-7803-7041-4

Type

conf

DOI

10.1109/ICASSP.2001.940883

Filename

940883