• DocumentCode
    3207437
  • Title

    Creating a Speech Enabled Avatar from a Single Photograph

  • Author

    Bitouk, Dmitri ; Nayar, Shree K.

  • Author_Institution
    Columbia Univ., New York
  • fYear
    2008
  • fDate
    8-12 March 2008
  • Firstpage
    107
  • Lastpage
    110
  • Abstract
    This paper presents a complete framework for creating a speech-enabled avatar from a single image of a person. Our approach uses a generic facial motion model which represents deformations of a prototype face during speech. We have developed an HMM-based facial animation algorithm which takes into account both lexical stress and coarticulation. This algorithm produces realistic animations of the prototype facial surface from either text or speech. The generic facial motion model can be transformed to a novel face geometry using a set of corresponding points between the prototype face surface and the novel face. Given a face photograph, a small number of manually selected features in the photograph are used to deform the prototype face surface. The deformed surface is then used to animate the face in the photograph. We show several examples of avatars that are driven by text and speech inputs.
  • Keywords
    avatars; computational geometry; computer animation; face recognition; hidden Markov models; speech-based user interfaces; HMM-based facial animation algorithm; coarticulation; face geometry; generic facial motion model; hidden Markov model; lexical stress; single photograph; speech enabled avatar; Avatars; Computer graphics; Deformable models; Facial animation; Hidden Markov models; Prototypes; Software prototyping; Solid modeling; Speech synthesis; Stress; H.5.2 [Information Interfaces and Presentation]: Multimedia Information Systems¿Animations; I.3.7 [Computer Graphics]: Three-Dimensional Graphics and Realism¿Animation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Virtual Reality Conference, 2008. VR '08. IEEE
  • Conference_Location
    Reno, NE
  • Print_ISBN
    978-1-4244-1971-5
  • Electronic_ISBN
    978-1-4244-1972-2
  • Type

    conf

  • DOI
    10.1109/VR.2008.4480758
  • Filename
    4480758