Title :
Realtime face analysis and synthesis using neural network
Author :
Morishima, Shigeo
Author_Institution :
ATR Media Integration Commun. Lab., Seikei Univ., Tokyo, Japan
Abstract :
Describes recent research results about how to generate an avatar´s face in a real-time process, exactly copying a real person´s face. It is very important for the synthesis of a real avatar to precisely duplicate the emotions and impressions included in the original face image and voice. A face-fitting tool from multi-angle camera images is introduced in order to make a real 3D face model with real texture and geometry that is very close to the original. When an avatar is speaking something, the voice signal is very essential for deciding the mouth shape features, so a real-time mouth shape control mechanism is proposed by conversion from speech parameters to lip shape parameters using a multi-layered neural network. For dynamic modeling of facial expressions, a muscle structure constraint is introduced to generate a facial expression naturally with just a few parameters. We also tried to obtain muscle parameters automatically in order to decide an expression from a local motion vector on the face calculated by optical flow in a video sequence. Finally, an approach is presented that enables the modeling of the emotions appearing on faces. A system with this approach helps us to analyze, synthesize and code face images at the emotional level
Keywords :
computer animation; face recognition; feedforward neural nets; geometry; image motion analysis; image sequences; image texture; muscle; real-time systems; shape control; speech processing; vectors; video signal processing; virtual reality; 3D face model; avatar face generation; dynamic modeling; emotions; face-fitting tool; facial expressions; geometry; impressions; lip shape parameters; local motion vector; mouth shape features; multi-angle camera images; multi-layered neural network; muscle parameters; muscle structure constraint; optical flow; real-time face analysis; real-time face synthesis; real-time mouth shape control mechanism; speech parameters; texture; video sequence; voice signal; Avatars; Cameras; Geometry; Mouth; Muscles; Network synthesis; Neural networks; Shape control; Solid modeling; Speech;
Conference_Titel :
Neural Networks for Signal Processing X, 2000. Proceedings of the 2000 IEEE Signal Processing Society Workshop
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7803-6278-0
DOI :
10.1109/NNSP.2000.889356