Title :
Determination of human vocal-tract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis
Author :
Yehia, H. ; Itakura, Fumitada
Author_Institution :
Sch. of Eng., Nagoya Univ., Japan
Abstract :
This article presents a method of estimation of the vocal-tract cross-sectional area, considered as a function of time and position along the tract length. The estimation is based on the speech formant frequencies, and uses a priori information about natural tract configurations. In general lines, the method is as follows. First, the cross-sectional area is represented by a two-dimensional Fourier cosine series expansion in time and space. Then, the locally linear relationship between spatial Fourier coefficients and formant frequencies is explored to formulate an acoustical constraint in the coefficient space. Finally, the sequence of vocal-tract areas corresponding to a given sequence of formants is estimated under positional, dynamical, and acoustical constraints. The system behavior is shown first for the static case of vowels and, then, for the dynamic case of vowel-to-vowel transitions. The method can be used as a bridge between articulatory parameter models and the speech parameter space. Moreover, it is potentially useful for area driven coders and synthesizers
Keywords :
Fourier analysis; speech coding; speech processing; speech synthesis; acoustical constraint; area driven coders; area driven synthesizers; articulatory parameter models; coefficient space; dynamical constraints; formant trajectories; human vocal-tract dynamic geometry; natural tract configurations; positional constraints; spatial Fourier analysis; spatial Fourier coefficients; speech acoustical parameters; speech formant frequencies; speech parameter space; temporal Fourier analysis; two-dimensional Fourier cosine series expansion; vocal tract length; vocal-tract cross-sectional area; vowel-to-vowel transitions; vowels; Art; Costs; Data mining; Humans; Information geometry; Speech processing; Speech synthesis; Synthesizers; Time factors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389252