Creating a Speech Enabled Avatar from a Single Photograph

Author

Bitouk, Dmitri ; Nayar, Shree K.

Author_Institution

Columbia Univ., New York

fYear

2008

fDate

8-12 March 2008

Firstpage

107

Lastpage

110

Abstract

This paper presents a complete framework for creating a speech-enabled avatar from a single image of a person. Our approach uses a generic facial motion model which represents deformations of a prototype face during speech. We have developed an HMM-based facial animation algorithm which takes into account both lexical stress and coarticulation. This algorithm produces realistic animations of the prototype facial surface from either text or speech. The generic facial motion model can be transformed to a novel face geometry using a set of corresponding points between the prototype face surface and the novel face. Given a face photograph, a small number of manually selected features in the photograph are used to deform the prototype face surface. The deformed surface is then used to animate the face in the photograph. We show several examples of avatars that are driven by text and speech inputs.

Keywords

avatars; computational geometry; computer animation; face recognition; hidden Markov models; speech-based user interfaces; HMM-based facial animation algorithm; coarticulation; face geometry; generic facial motion model; hidden Markov model; lexical stress; single photograph; speech enabled avatar; Avatars; Computer graphics; Deformable models; Facial animation; Hidden Markov models; Prototypes; Software prototyping; Solid modeling; Speech synthesis; Stress; H.5.2 [Information Interfaces and Presentation]: Multimedia Information SystemsÂ¿Animations; I.3.7 [Computer Graphics]: Three-Dimensional Graphics and RealismÂ¿Animation;

fLanguage

English

Publisher

ieee

Conference_Titel

Virtual Reality Conference, 2008. VR '08. IEEE

Conference_Location

Reno, NE

Print_ISBN

978-1-4244-1971-5

Electronic_ISBN

978-1-4244-1972-2

Type

conf

DOI

10.1109/VR.2008.4480758

Filename

4480758