Speech Animation Using Coupled Hidden Markov Models

Author

Xie, Lei ; Liu, Zhi-Qiang

Author_Institution

Sch. of Creative Media, Hong Kong City Univ.

Volume

1

fYear

0

fDate

0-0 0

Firstpage

1128

Lastpage

1131

Abstract

We present a novel speech animation approach using coupled hidden Markov models (CHMMs). Different from the conventional HMMs that use a single state chain to model the audio-visual speech with tight inter-modal synchronization, we use the CHMMs to model the asynchrony, different discriminative abilities, and temporal coupling between the audio speech and the visual speech, which are important factors for animations looking natural. Based on the audio-visual CHMMs, visual animation parameters are predicted from audio through an EM-based audio to visual conversion algorithm. Experiments on the JEWEL AV database show that compared with the conventional HMMs, the CHMMs can output visual parameters that are much closer to the actual ones. Explicit modelling of audio-visual speech is promising in speech animation

Keywords

computer animation; hidden Markov models; speech processing; audio speech animation; audio-visual speech; coupled hidden Markov model; explicit modelling; intermodal synchronization; visual animation parameter; visual speech; Application software; Augmented virtuality; Face; Facial animation; Hidden Markov models; Multimedia databases; Speech processing; Speech synthesis; Visual databases; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 2006. ICPR 2006. 18th International Conference on

Conference_Location

Hong Kong

ISSN

1051-4651

Print_ISBN

0-7695-2521-0

Type

conf

DOI

10.1109/ICPR.2006.1074

Filename

1699088