DocumentCode
3400345
Title
Acoustic driven viseme identification for face animation
Author
Zhong, Jialin ; Chou, Wu ; Petajan, Eric
Author_Institution
Lucent Technol., AT&T Bell Labs., Murray Hill, NJ, USA
fYear
1997
fDate
23-25 Jun 1997
Firstpage
7
Lastpage
12
Abstract
Unlike other image templates, visemes have identities in two different media. In audio domain, they are often related to basic linguistic units such as phonemes. In image domain, they are defined by the images of human articulators, such as mouth shapes, chin movements, etc. In this paper, an approach of extracting visemes from both image and acoustic domains is presented. In image domain, the mouth shapes, represented by feature points on inner lip contours, are extracted through face tracking and mouth image analysis. In acoustic domain, viseme segments are obtained automatically by aligning phoneme strings to audio signals through a Viterbi alignment process
Keywords
Viterbi detection; computer animation; feature extraction; image matching; speech processing; speech synthesis; Viterbi alignment process; acoustic driven viseme identification; audio signals; basic linguistic units; chin movements; face animation; human articulators; inner lip contours; mouth shapes; phoneme strings; phonemes; Face; Facial animation; Hidden Markov models; Humans; Image segmentation; Image sequences; Mouth; Shape; Speech synthesis; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 1997., IEEE First Workshop on
Conference_Location
Princeton, NJ
Print_ISBN
0-7803-3780-8
Type
conf
DOI
10.1109/MMSP.1997.602605
Filename
602605
Link To Document