DocumentCode
3315454
Title
Sample-based synthesis of talking heads
Author
Graf, Hans Peter ; Cosatto, Eric
Author_Institution
AT&T Labs., Middletown, NJ, USA
fYear
2001
fDate
2001
Firstpage
3
Lastpage
7
Abstract
Synthesizing photo-realistic talking heads is a challenging problem, and so far all attempts using conventional computer graphics produced heads with a distinctly synthetic look. In order to look credible, a head must show a picture-perfect appearance, natural head movements, and good lip-sound synchronization. We use sample-based graphics to achieve more photo-realistic appearances than what is possible with the traditional approach of 3D modeling and texture mapping. For sample-based graphics, first parts of faces are cut from recorded images and are scored in a database. New sequences are then synthesized by integrating such parts into whole faces. With sufficient recorded data this approach produces by far the most naturally looking speech articulation. We integrate 3D modeling with the sample-based technique in order to enhance its flexibility. This allows, for example, showing the head over a much wider range of orientations
Keywords
computer animation; image sequences; solid modelling; synchronisation; 3D modeling; computer graphics; head movements; image sequences; lip-sound synchronization; photo-realistic appearances; picture-perfect appearance; sample-based graphics; talking heads; Computer graphics; Computer interfaces; Customer service; Facial animation; Image databases; Lips; Magnetic heads; Shape; Speech synthesis; Videos;
fLanguage
English
Publisher
ieee
Conference_Titel
Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, 2001. Proceedings. IEEE ICCV Workshop on
Conference_Location
Vancouver, BC
ISSN
1530-1044
Print_ISBN
0-7695-1074-4
Type
conf
DOI
10.1109/RATFG.2001.938903
Filename
938903
Link To Document