مرکز منطقه ای اطلاع رساني علوم و فناوري - Partial Linear Regression for Audio-Driven Talking Head Application

DocumentCode :

2235330

Title :

Partial Linear Regression for Audio-Driven Talking Head Application

Author :

Hsieh, Chao-Kuei ; Chen, Yung-Chang

Author_Institution :

Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu

fYear :

2005

fDate :

6-6 July 2005

Firstpage :

281

Lastpage :

284

Abstract :

Virtual avatars in many applications are constructed manually or by a single speech-driven model, which needs a lot of training data and long training time. It is an essential problem to build up a user-dependent model more efficiently. In this paper, a new adaptation method, called the partial linear regression (PLR) is proposed and adopted in an audio-driven talking head application. This method allows users to adapt the partial parameters from the available adaptive data while keeping the others unchanged. In our experiments, the PLR algorithm can retrench the hours of time spent on retraining a new user-dependent model, and adjust the user-independent model to a more personalized one. The animated results with adapted models were 36% closer to the user-dependent model than using the pre-trained user-independent model

Keywords :

avatars; computer animation; regression analysis; PLR algorithm; adaptive data; audio-driven talking head application; computer animation; partial linear regression; single speech-driven model; user-dependent model; virtual avatar; Application software; Avatars; Chaos; Hidden Markov models; Linear regression; Loudspeakers; Maximum likelihood linear regression; Neural networks; Speech synthesis; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on

Conference_Location :

Amsterdam

Print_ISBN :

0-7803-9331-7

Type :

conf

DOI :

10.1109/ICME.2005.1521415

Filename :

1521415

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2235330