مرکز منطقه ای اطلاع رساني علوم و فناوري - Constrained optimization for a speech driven talking head

DocumentCode :

1560627

Title :

Constrained optimization for a speech driven talking head

Author :

Choi, Kyoung-Ho ; Lee, Jong-Hoon

Author_Institution :

Electron. & Telecommun. Res. Inst., Comput. & Software Res. Inst., Daejeon, South Korea

Volume :

fYear :

2003

Abstract :

In this paper, a novel algorithm for audio-to-visual conversion based on constrained optimization is presented. Based on facial muscle analysis, the dynamics of mouth movements are modeled and constraints are obtained from them. The obtained constraints are used to estimate visual parameters from speech in a framework of HMM-based visual parameter estimation. The proposed constrained optimization approach finds visual parameters that satisfy given constraints and maximize the auxiliary functions that are used to train the audio-visual HMMs. This approach enables the algorithm to produce reliable visual parameters even in noisy environments. Experimental results demonstrate that the proposed audio-to-visual conversion method is able to follow true visual parameters robustly in various noisy environments.

Keywords :

audio-visual systems; computer animation; hidden Markov models; multimedia systems; optimisation; speech processing; speech recognition; speech-based user interfaces; synchronisation; HMM-based visual parameter estimation; animated face; audio-to-visual conversion; audio-visual HMM training; constrained optimization; facial muscle analysis; mouth movement dynamics; multimedia applications; noisy environments; speech-driven talking head; synchronization; talking head systems; virtual face; Constraint optimization; Degradation; Facial animation; Facial muscles; Hidden Markov models; Mouth; Parameter estimation; Robustness; Speech; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on

Print_ISBN :

0-7803-7761-3

Type :

conf

DOI :

10.1109/ISCAS.2003.1206035

Filename :

1206035

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1560627