Title :
A functional articulatory dynamic model for speech production
Author :
Lee, Leo J. ; Fieguth, Paul ; Deng, Li
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
Abstract :
Introduces a statistical speech production model. The model synthesizes natural speech by modeling some key dynamic properties of vocal articulators in a linear/nonlinear state-space framework. The goal-oriented movements of the articulators (tongue tip, tongue dorsum, upper lip, lower lip, and jaw) are described in a linear dynamic state equation. The resulting articulatory trajectories, combined with the effects of the velum and larynx, are nonlinearly mapped into the acoustic feature space (MFCCs). The key challenges in this model are the development of a nonlinear parameter estimation methodology, and the incorporation of appropriate prior assumptions to assert in the articulatory dynamic structure. Such a model can also be directly applied to speech recognition to better account for coarticulation and phonetic reduction phenomena with considerably fewer parameters than HMM based approaches
Keywords :
matrix algebra; maximum likelihood estimation; parameter estimation; speech synthesis; state-space methods; acoustic feature space; articulatory trajectories; coarticulation; dynamic properties; functional articulatory dynamic model; jaw; larynx; lower lip; natural speech; nonlinear parameter estimation methodology; phonetic reduction phenomena; speech recognition; state-space framework; statistical speech production model; tongue dorsum; tongue tip; upper lip; velum; vocal articulators; Hidden Markov models; Larynx; Natural languages; Nonlinear acoustics; Nonlinear equations; Parameter estimation; Speech recognition; Speech synthesis; Tongue; Trajectory;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941035