DocumentCode :
2235773
Title :
Face and gesture capturing and cloning for life-like agent
Author :
Morishima, Shigeo
Author_Institution :
Waseda Univ., Tokyo, Japan
fYear :
2004
fDate :
20-22 Sept. 2004
Firstpage :
171
Lastpage :
176
Abstract :
Face and gesture cloning is essential to make a life-like agent more believable and to give it a personality and a character of target person. To realize cloning, an accurate face capture and motion capture are inevitable to get corpus data about face expressions, speaking scenes and gestures. In this paper, our recent approach to capture the personal feature of face and gesture is presented. For the face capturing, a face location and angles are estimated from video sequence with personal 3D face model and then a synthetic face model data is imposed into frames to realize automatic stand-in system or multimodal translation system. A stand-in is a common technique for movies and TV programs in foreign languages. The current stand-in that only substitutes the voice channel results awkward matching to the mouth motion. Videophone with automatic voice translation are expected to be widely used in the near future, which may face the same problem without lip-synchronized speaking face image translation. We introduce a method to track motion of the face from the video image and then replace the face part or only mouth part with synthesized one which is synchronized with synthetic voice or spoken voice. This is one of the key technologies not only for speaking image translation and communication system, but also for an interactive entertainment system. An interactive movie system is introduced as an application of entertainment system. Capturing and copying a facial expression based on a physics base facial muscle constraint has been already presented, so in this paper, this part is not described. For a gesture capturing, commercially available motion capture products give us fairly precise movements of human body segments but do not measure enough information to define skeletal posture in its entirety. This paper describes how to obtain the complete posture of skeletal structure with the help of marker locations relative to bones that are derived from MRI data sets.
Keywords :
biomedical MRI; bone; face recognition; gesture recognition; image motion analysis; interactive systems; motion estimation; speech processing; tracking; 3D facial model; MRI data sets; automatic stand-in system; automatic voice translation; face capture; face cloning; face location; facial expression; facial muscle constraint; forearm bones; foreign languages; gesture capture; gesture cloning; human body segments; human skeletal posture; interactive entertainment system; interactive movie system; lifelike agent; motion capture; motion capture markers; motion tracking; multimodal translation system; personal 3D face model; speaking image translation; synthetic face model; video sequence; video translation system; videophone; Cloning; Facial muscles; Humans; Layout; Motion pictures; Mouth; Physics; TV; Tracking; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robot and Human Interactive Communication, 2004. ROMAN 2004. 13th IEEE International Workshop on
Print_ISBN :
0-7803-8570-5
Type :
conf
DOI :
10.1109/ROMAN.2004.1374750
Filename :
1374750
Link To Document :
بازگشت