DocumentCode :
1560627
Title :
Constrained optimization for a speech driven talking head
Author :
Choi, Kyoung-Ho ; Lee, Jong-Hoon
Author_Institution :
Electron. & Telecommun. Res. Inst., Comput. & Software Res. Inst., Daejeon, South Korea
Volume :
2
fYear :
2003
Abstract :
In this paper, a novel algorithm for audio-to-visual conversion based on constrained optimization is presented. Based on facial muscle analysis, the dynamics of mouth movements are modeled and constraints are obtained from them. The obtained constraints are used to estimate visual parameters from speech in a framework of HMM-based visual parameter estimation. The proposed constrained optimization approach finds visual parameters that satisfy given constraints and maximize the auxiliary functions that are used to train the audio-visual HMMs. This approach enables the algorithm to produce reliable visual parameters even in noisy environments. Experimental results demonstrate that the proposed audio-to-visual conversion method is able to follow true visual parameters robustly in various noisy environments.
Keywords :
audio-visual systems; computer animation; hidden Markov models; multimedia systems; optimisation; speech processing; speech recognition; speech-based user interfaces; synchronisation; HMM-based visual parameter estimation; animated face; audio-to-visual conversion; audio-visual HMM training; constrained optimization; facial muscle analysis; mouth movement dynamics; multimedia applications; noisy environments; speech-driven talking head; synchronization; talking head systems; virtual face; Constraint optimization; Degradation; Facial animation; Facial muscles; Hidden Markov models; Mouth; Parameter estimation; Robustness; Speech; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
Print_ISBN :
0-7803-7761-3
Type :
conf
DOI :
10.1109/ISCAS.2003.1206035
Filename :
1206035
Link To Document :
بازگشت