DocumentCode :
149613
Title :
Audiovisual to area and length functions inversion of human vocal tract
Author :
Elie, Benjamin ; Laprie, Yves
Author_Institution :
LORIA, Univ. de Lorraine, Nancy, France
fYear :
2014
fDate :
1-5 Sept. 2014
Firstpage :
2300
Lastpage :
2304
Abstract :
This paper proposes a multimodal approach to estimate the area function and the length of the vocal tract of oral vowels. The method is based on an iterative technique consisting in deforming an initial area function so that the output acoustic vector matches a specified target. The chosen acoustic vector is the formant frequency pattern. In order to regularize the ill-problem, several constraints are added to the algorithm. First, the lip termination area is estimated via a facial capture software. Then, the area function is constrained in such a way that it does not get too far from a neutral position, and it does not change too quickly from a temporal frame to the next, when dealing with dynamic inversion. The method proves to be efficient to approximate the area function and the length of the vocal tract for oral french vowels, both in static and dynamic configurations.
Keywords :
acoustic signal processing; audio-visual systems; iterative methods; speech processing; acoustic vector; area function inversion; audiovisual inversion; dynamic configurations; facial capture software; formant frequency pattern; human vocal tract; iterative technique; length function inversion; lip termination area; multimodal approach; oral french vowels; static configurations; Acoustics; Apertures; Estimation; Frequency measurement; Sensitivity; Speech; Vectors; Audiovisual inversion; Dynamic inversion; Regularization; Vocal tract length;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon
Type :
conf
Filename :
6952840
Link To Document :
بازگشت