Title :
Creating 3D speech-driven talking heads: a probabilistic network approach
Author :
Choi, Kyoungho ; Hwang, Jenq-Neng
Author_Institution :
Dept. of Electr. Eng., Univ. of Washington, Seattle, WA, USA
Abstract :
We present a probabilistic approach to decide whether or not extracted facial features are appropriate for creating 3D face models. Automatically extracted 2D facial features from a video sequence are fed into the proposed probabilistic framework before a corresponding 3D face model is built to avoid generating unnatural or non-realistic 3D faces. In addition, a new algorithm for audio-to-visual conversion based on constrained optimization is presented to generate visual parameters for driving the mouth movement of the 3D face models from speech. Lagrangian optimization is applied to transform a constrained problem into an unconstrained problem. Experimental results are provided to show the effectiveness and validity of the proposed algorithms for various video sequences and speech.
Keywords :
feature extraction; image sequences; optimisation; probability; solid modelling; speech processing; video signal processing; 3D face models; 3D speech-driven talking heads; Lagrangian optimization; audio-to-visual conversion; constrained optimization; facial feature extraction; mouth movement; probabilistic network approach; video sequence; Constraint optimization; Face detection; Facial features; Head; Lagrangian functions; Mouth; Real time systems; Robustness; Speech; Video sequences;
Conference_Titel :
Image Processing. 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7622-6
DOI :
10.1109/ICIP.2002.1038193