DocumentCode
3168215
Title
Baum-Welch hidden Markov model inversion for reliable audio-to-visual conversion
Author
Choi, KyouugHo ; Hwang, Jenq-Neng
Author_Institution
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
fYear
1999
fDate
1999
Firstpage
175
Lastpage
180
Abstract
In this paper, a novel audio-to-visual conversion method is presented. Many multimedia applications, such as videophones, videoconferencing, man-machine interface, language dubbing, character animation in virtual reality, etc., require techniques for synchronizing audio and video in a synthesized talking head sequence. For these applications, it is necessary to reliably estimate accurate mouth (visual) movements from the corresponding speech (audio) data. The hidden Markov model inversion (HMMI) technique introduced for robust speech recognition is extended in this paper into the audio-visual feature space. Based on the Baum-Welch HMMI method, reliable visual parameters are extracted given speech data only. Our preliminary simulation results show that the estimated visual parameters from the proposed method match the true visual parameters smoothly as well as accurately. The proposed estimation technique can be combined with video coding and graphics techniques for other multimedia applications
Keywords
audio signal processing; hidden Markov models; image sequences; multimedia systems; speech recognition; Baum-Welch hidden Markov model inversion; accurate mouth movements; audio/video synchronisation; character animation; graphics techniques; language dubbing; man-machine interface; multimedia applications; reliable audio-to-visual conversion; reliable visual parameter extraction; speech data; synthesised talking head sequence; video coding; videoconferencing; videophone; virtual reality; Animation; Data mining; Hidden Markov models; Mouth; Robustness; Speech recognition; Speech synthesis; Teleconferencing; User interfaces; Virtual reality;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 1999 IEEE 3rd Workshop on
Conference_Location
Copenhagen
Print_ISBN
0-7803-5610-1
Type
conf
DOI
10.1109/MMSP.1999.793816
Filename
793816
Link To Document