• DocumentCode
    3168215
  • Title

    Baum-Welch hidden Markov model inversion for reliable audio-to-visual conversion

  • Author

    Choi, KyouugHo ; Hwang, Jenq-Neng

  • Author_Institution
    Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    175
  • Lastpage
    180
  • Abstract
    In this paper, a novel audio-to-visual conversion method is presented. Many multimedia applications, such as videophones, videoconferencing, man-machine interface, language dubbing, character animation in virtual reality, etc., require techniques for synchronizing audio and video in a synthesized talking head sequence. For these applications, it is necessary to reliably estimate accurate mouth (visual) movements from the corresponding speech (audio) data. The hidden Markov model inversion (HMMI) technique introduced for robust speech recognition is extended in this paper into the audio-visual feature space. Based on the Baum-Welch HMMI method, reliable visual parameters are extracted given speech data only. Our preliminary simulation results show that the estimated visual parameters from the proposed method match the true visual parameters smoothly as well as accurately. The proposed estimation technique can be combined with video coding and graphics techniques for other multimedia applications
  • Keywords
    audio signal processing; hidden Markov models; image sequences; multimedia systems; speech recognition; Baum-Welch hidden Markov model inversion; accurate mouth movements; audio/video synchronisation; character animation; graphics techniques; language dubbing; man-machine interface; multimedia applications; reliable audio-to-visual conversion; reliable visual parameter extraction; speech data; synthesised talking head sequence; video coding; videoconferencing; videophone; virtual reality; Animation; Data mining; Hidden Markov models; Mouth; Robustness; Speech recognition; Speech synthesis; Teleconferencing; User interfaces; Virtual reality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 1999 IEEE 3rd Workshop on
  • Conference_Location
    Copenhagen
  • Print_ISBN
    0-7803-5610-1
  • Type

    conf

  • DOI
    10.1109/MMSP.1999.793816
  • Filename
    793816