• DocumentCode
    311368
  • Title

    Audio-visual interaction in multimedia communication

  • Author

    Chen, Tsuhan ; Rao, Ram R.

  • Author_Institution
    Res., AT&T Bell Labs., Holmdel, NJ, USA
  • Volume
    1
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    179
  • Abstract
    To many people, the word “multimedia” simply means the combination of various forms of information: text, speech, music, images, graphics and video. What is often overlooked is the interaction among these forms. In this paper, we present our results in exploiting the audio-visual interaction that is very significant in multimedia communication. The applications include lip synchronization, joint audio-video coding, and person verification. We present the enabling technologies, including audio-to-visual mapping and facial image analysis, for these applications. Our results show that the joint processing of audio and video provides advantages that are not available when audio and video are studied separately
  • Keywords
    multilayer perceptrons; multimedia communication; probability; speech processing; speech recognition; teleconferencing; video coding; audio-to-visual mapping; audio-visual interaction; enabling technologies; facial image analysis; joint audio-video coding; lip synchronization; multimedia communication; person verification; Background noise; Graphics; Humans; Image analysis; Image converters; Mouth; Multimedia communication; Shape; Speech; Videoconference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.599592
  • Filename
    599592