• DocumentCode
    301259
  • Title

    Speech-assisted lip synchronization in audio-visual communications

  • Author

    Tsuhan Chen ; Graf, H.P. ; Haskell, B. ; Petajan, E. ; Yao Wang ; Chen, Homer ; Chou, Wu

  • Author_Institution
    AT&T Bell Labs., Holmdel, NJ, USA
  • Volume
    2
  • fYear
    1995
  • fDate
    23-26 Oct 1995
  • Firstpage
    579
  • Abstract
    We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined
  • Keywords
    audio-visual systems; image processing; speech processing; synchronisation; teleconferencing; video coding; videotelephony; audio-visual communications; image processing; speech analysis; speech information; speech-assisted frame-rate conversion; speech-assisted lip synchronization; talking head video coding; video telephony; videoconferencing; Decoding; Head; Humans; Image coding; Image converters; Image motion analysis; Image processing; Image sequence analysis; Mouth; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 1995. Proceedings., International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    0-8186-7310-9
  • Type

    conf

  • DOI
    10.1109/ICIP.1995.537545
  • Filename
    537545