Title :
Partial Linear Regression for Audio-Driven Talking Head Application
Author :
Hsieh, Chao-Kuei ; Chen, Yung-Chang
Author_Institution :
Dept. of Electr. Eng., Nat. Tsing Hua Univ., Hsinchu
Abstract :
Virtual avatars in many applications are constructed manually or by a single speech-driven model, which needs a lot of training data and long training time. It is an essential problem to build up a user-dependent model more efficiently. In this paper, a new adaptation method, called the partial linear regression (PLR) is proposed and adopted in an audio-driven talking head application. This method allows users to adapt the partial parameters from the available adaptive data while keeping the others unchanged. In our experiments, the PLR algorithm can retrench the hours of time spent on retraining a new user-dependent model, and adjust the user-independent model to a more personalized one. The animated results with adapted models were 36% closer to the user-dependent model than using the pre-trained user-independent model
Keywords :
avatars; computer animation; regression analysis; PLR algorithm; adaptive data; audio-driven talking head application; computer animation; partial linear regression; single speech-driven model; user-dependent model; virtual avatar; Application software; Avatars; Chaos; Hidden Markov models; Linear regression; Loudspeakers; Maximum likelihood linear regression; Neural networks; Speech synthesis; Training data;
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-7803-9331-7
DOI :
10.1109/ICME.2005.1521415