مرکز منطقه ای اطلاع رساني علوم و فناوري - Determination of human vocal-tract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis

DocumentCode :

290032

Title :

Determination of human vocal-tract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis

Author :

Yehia, H. ; Itakura, Fumitada

Author_Institution :

Sch. of Eng., Nagoya Univ., Japan

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

This article presents a method of estimation of the vocal-tract cross-sectional area, considered as a function of time and position along the tract length. The estimation is based on the speech formant frequencies, and uses a priori information about natural tract configurations. In general lines, the method is as follows. First, the cross-sectional area is represented by a two-dimensional Fourier cosine series expansion in time and space. Then, the locally linear relationship between spatial Fourier coefficients and formant frequencies is explored to formulate an acoustical constraint in the coefficient space. Finally, the sequence of vocal-tract areas corresponding to a given sequence of formants is estimated under positional, dynamical, and acoustical constraints. The system behavior is shown first for the static case of vowels and, then, for the dynamic case of vowel-to-vowel transitions. The method can be used as a bridge between articulatory parameter models and the speech parameter space. Moreover, it is potentially useful for area driven coders and synthesizers

Keywords :

Fourier analysis; speech coding; speech processing; speech synthesis; acoustical constraint; area driven coders; area driven synthesizers; articulatory parameter models; coefficient space; dynamical constraints; formant trajectories; human vocal-tract dynamic geometry; natural tract configurations; positional constraints; spatial Fourier analysis; spatial Fourier coefficients; speech acoustical parameters; speech formant frequencies; speech parameter space; temporal Fourier analysis; two-dimensional Fourier cosine series expansion; vocal tract length; vocal-tract cross-sectional area; vowel-to-vowel transitions; vowels; Art; Costs; Data mining; Humans; Information geometry; Speech processing; Speech synthesis; Synthesizers; Time factors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389252

Filename :

389252

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290032