DocumentCode
3207437
Title
Creating a Speech Enabled Avatar from a Single Photograph
Author
Bitouk, Dmitri ; Nayar, Shree K.
Author_Institution
Columbia Univ., New York
fYear
2008
fDate
8-12 March 2008
Firstpage
107
Lastpage
110
Abstract
This paper presents a complete framework for creating a speech-enabled avatar from a single image of a person. Our approach uses a generic facial motion model which represents deformations of a prototype face during speech. We have developed an HMM-based facial animation algorithm which takes into account both lexical stress and coarticulation. This algorithm produces realistic animations of the prototype facial surface from either text or speech. The generic facial motion model can be transformed to a novel face geometry using a set of corresponding points between the prototype face surface and the novel face. Given a face photograph, a small number of manually selected features in the photograph are used to deform the prototype face surface. The deformed surface is then used to animate the face in the photograph. We show several examples of avatars that are driven by text and speech inputs.
Keywords
avatars; computational geometry; computer animation; face recognition; hidden Markov models; speech-based user interfaces; HMM-based facial animation algorithm; coarticulation; face geometry; generic facial motion model; hidden Markov model; lexical stress; single photograph; speech enabled avatar; Avatars; Computer graphics; Deformable models; Facial animation; Hidden Markov models; Prototypes; Software prototyping; Solid modeling; Speech synthesis; Stress; H.5.2 [Information Interfaces and Presentation]: Multimedia Information Systems¿Animations; I.3.7 [Computer Graphics]: Three-Dimensional Graphics and Realism¿Animation;
fLanguage
English
Publisher
ieee
Conference_Titel
Virtual Reality Conference, 2008. VR '08. IEEE
Conference_Location
Reno, NE
Print_ISBN
978-1-4244-1971-5
Electronic_ISBN
978-1-4244-1972-2
Type
conf
DOI
10.1109/VR.2008.4480758
Filename
4480758
Link To Document