Title :
Modeling the Vocal Tract Transfer Function Using a 3D Digital Waveguide Mesh
Author :
Speed, Matt ; Murphy, Damian ; Howard, David
Author_Institution :
Dept. of Electron., Univ. of York, Heslington, UK
Abstract :
The digital waveguide mesh has been shown to be capable of reproducing the acoustic impulse response of cylindrical vocal tract analogs. This study extends the same methodology to three-dimensional simulation of the acoustic response of graphical models of the vocal tract obtained from magnetic resonance imaging for a group of trained subjects. By such simulation of the vocal tract transfer function and convolution with an appropriate source waveform, basic phonemes are resynthesized and compared with benchmark audio recordings. The technologies and techniques used for simulation are described, alongside the protocol for image capture and the process for collection of benchmark audio. The results of simulation and acoustic recording are then evaluated and compared. The value of three-dimensional simulation in comparison to existing lower-dimensionality equivalents is assessed. It is found that while three-dimensional simulation provides a strong representation of the low frequency vocal tract transfer function, at higher frequencies its performance becomes geometry-dependent. MRI imaging and benchmark audio is provided for future studies and to permit comparison with comparable means of acoustic simulation.
Keywords :
acoustic signal processing; transfer functions; 3D digital waveguide mesh; MRI imaging; acoustic impulse response; acoustic recording; basic phonemes; benchmark audio collection; benchmark audio recordings; cylindrical vocal tract analogs; graphical models; image capture; low-frequency vocal tract transfer function representation; lower-dimensionality equivalents; magnetic resonance imaging; source waveform; three-dimensional simulation; vocal tract transfer function modeling; vocal tract transfer function simulation; Acoustic waveguides; Magnetic resonance imaging; Numerical models; Solid modeling; Transfer functions; Signal processing; acoustic signal processing; human voice; speech synthesis;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2013.2294579