Title :
Using HMMs in audio-to-visual conversion
Author :
Rao, R. ; Mersereau, R. ; Chen, T.
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
One emerging application which exploits the correlation between audio and video is speech driven facial animation. The goal of speech driven facial animation is to synthesize realistic video sequences from acoustic speech. Much of the previous research has implemented this audio to visual conversion strategy with existing techniques such as vector quantization and neural networks. We examine how this conversion process can be accomplished with hidden Markov models
Keywords :
audio-visual systems; computer animation; hidden Markov models; speech recognition; video signal processing; HMMs; acoustic speech; audio to visual conversion strategy; conversion process; hidden Markov models; neural networks; realistic video sequences; speech driven facial animation; vector quantization; Facial animation; Hidden Markov models; Image converters; Mouth; Multilayer perceptrons; Signal processing; Speech recognition; Speech synthesis; Streaming media; Video sequences;
Conference_Titel :
Multimedia Signal Processing, 1997., IEEE First Workshop on
Conference_Location :
Princeton, NJ
Print_ISBN :
0-7803-3780-8
DOI :
10.1109/MMSP.1997.602607