• DocumentCode
    1886297
  • Title

    Speech-assisted video processing: interpolation and low-bitrate coding

  • Author

    Chen, Tsuhan ; Graf, Hans Peter ; Wang, Kuansan

  • Author_Institution
    AT&T Bell Labs., Holmdel, NJ, USA
  • Volume
    2
  • fYear
    1994
  • fDate
    31 Oct-2 Nov 1994
  • Firstpage
    975
  • Abstract
    We utilize speech information to improve the quality of audio/visual communications, such as videotelephony, videoconferencing, and multimedia. In particular, marriage of speech processing and image processing can solve problems related to lip synchronization. Two main techniques proposed in this paper are: speech-assisted interpolation and speech-assisted coding of talking head video. Audio/video sequences are presented to demonstrate our techniques
  • Keywords
    audio-visual systems; image sequences; interpolation; speech coding; synchronisation; video coding; audio/video sequences; audio/visual communications; image processing; lip synchronization; low-bit rate coding; multimedia; speech information; speech processing; speech-assisted interpolation; speech-assisted video processing; talking head video coding; videoconferencing; videotelephony; Bit rate; Head; Image coding; Interpolation; Mouth; Speech analysis; Speech coding; Speech processing; Teleconferencing; Video coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 1994. 1994 Conference Record of the Twenty-Eighth Asilomar Conference on
  • Conference_Location
    Pacific Grove, CA
  • ISSN
    1058-6393
  • Print_ISBN
    0-8186-6405-3
  • Type

    conf

  • DOI
    10.1109/ACSSC.1994.471605
  • Filename
    471605