Title :
A parametric three-dimensional model of the vocal-tract based on MRI data
Author :
Yehia, Hani ; Tiede, Mark
Author_Institution :
ATR Human Inf. Res. Labs., Kyoto, Japan
Abstract :
Twenty four three-dimensional (3D) vocal-tract (VT) shapes extracted from MRI data are used to derive a parametric model for the vocal-tract. The method is as follows: first, each 3D VT shape is sampled using a semi-cylindrical grid whose position is determined by reference points based on the VT anatomy. After that, the VT projections onto each plane of the grid are represented by their two main components obtained via principal component analysis (PCA). PCA is once again used to parametrize the sequences of coefficients that represent the sections along the tract. It was verified that the first four components can explain about 90% of the total variance of the observed shapes. Following this procedure, 3D VT shapes are approximated by linear combinations of four 3D basis functions. Finally, it is shown that the four parameters of the model can be estimated from the VT midsagittal profiles
Keywords :
biomedical NMR; parameter estimation; physiological models; signal sampling; speech processing; 3D basis functions; 3D vocal tract shapes; MRI data; PCA; articulatory speech processes; coefficients; observed shape variance; parametric model; parametric three-dimensional model; principal component analysis; reference points; sampling; semicylindrical grid; vocal tract midsagittal profiles; Anatomy; Data mining; Humans; Laboratories; Light rail systems; Magnetic resonance imaging; Principal component analysis; Shape; Speech processing; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.598809