DocumentCode :
290032
Title :
Determination of human vocal-tract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis
Author :
Yehia, H. ; Itakura, Fumitada
Author_Institution :
Sch. of Eng., Nagoya Univ., Japan
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
This article presents a method of estimation of the vocal-tract cross-sectional area, considered as a function of time and position along the tract length. The estimation is based on the speech formant frequencies, and uses a priori information about natural tract configurations. In general lines, the method is as follows. First, the cross-sectional area is represented by a two-dimensional Fourier cosine series expansion in time and space. Then, the locally linear relationship between spatial Fourier coefficients and formant frequencies is explored to formulate an acoustical constraint in the coefficient space. Finally, the sequence of vocal-tract areas corresponding to a given sequence of formants is estimated under positional, dynamical, and acoustical constraints. The system behavior is shown first for the static case of vowels and, then, for the dynamic case of vowel-to-vowel transitions. The method can be used as a bridge between articulatory parameter models and the speech parameter space. Moreover, it is potentially useful for area driven coders and synthesizers
Keywords :
Fourier analysis; speech coding; speech processing; speech synthesis; acoustical constraint; area driven coders; area driven synthesizers; articulatory parameter models; coefficient space; dynamical constraints; formant trajectories; human vocal-tract dynamic geometry; natural tract configurations; positional constraints; spatial Fourier analysis; spatial Fourier coefficients; speech acoustical parameters; speech formant frequencies; speech parameter space; temporal Fourier analysis; two-dimensional Fourier cosine series expansion; vocal tract length; vocal-tract cross-sectional area; vowel-to-vowel transitions; vowels; Art; Costs; Data mining; Humans; Information geometry; Speech processing; Speech synthesis; Synthesizers; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389252
Filename :
389252
Link To Document :
بازگشت