• DocumentCode
    1112016
  • Title

    Audio visual interaction in multimedia

  • Author

    Chen, Tsuhan ; Rao, Ram

  • Author_Institution
    AT&T Bell Labs., Holmdel, NJ, USA
  • Volume
    11
  • Issue
    6
  • fYear
    1995
  • fDate
    11/1/1995 12:00:00 AM
  • Firstpage
    21
  • Lastpage
    26
  • Abstract
    Audio-visual interaction is a very important issue in personal communication applications. The research and development of multimedia communication systems should account for this interaction. In this paper, we address a number of areas related to audio-visual interaction, such as automatic lipreading, speech-driven talking heads, and lip synchronization. In particular, we will discuss a new trend in video coding research: joint audio-video coding. Given that mouth movements are very difficult to code because of its rapid, complex, and non-rigid motion (so conventional block-based motion-compensation methods fail), we will explain how having extra help from the acoustic signal can enable us to code the mouth movements more efficiently
  • Keywords
    audio coding; multimedia communication; synchronisation; video coding; acoustic signal; audio-visual interaction; automatic lipreading; joint audio-video coding; lip synchronization; mouth movements; multimedia communication systems; personal communication applications; speech-driven talking heads; video coding research; Background noise; Humans; Lips; Mouth; Muscles; Natural languages; Pharynx; Speech; Teeth; Tongue;
  • fLanguage
    English
  • Journal_Title
    Circuits and Devices Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    8755-3996
  • Type

    jour

  • DOI
    10.1109/101.476612
  • Filename
    476612