DocumentCode
311368
Title
Audio-visual interaction in multimedia communication
Author
Chen, Tsuhan ; Rao, Ram R.
Author_Institution
Res., AT&T Bell Labs., Holmdel, NJ, USA
Volume
1
fYear
1997
fDate
21-24 Apr 1997
Firstpage
179
Abstract
To many people, the word “multimedia” simply means the combination of various forms of information: text, speech, music, images, graphics and video. What is often overlooked is the interaction among these forms. In this paper, we present our results in exploiting the audio-visual interaction that is very significant in multimedia communication. The applications include lip synchronization, joint audio-video coding, and person verification. We present the enabling technologies, including audio-to-visual mapping and facial image analysis, for these applications. Our results show that the joint processing of audio and video provides advantages that are not available when audio and video are studied separately
Keywords
multilayer perceptrons; multimedia communication; probability; speech processing; speech recognition; teleconferencing; video coding; audio-to-visual mapping; audio-visual interaction; enabling technologies; facial image analysis; joint audio-video coding; lip synchronization; multimedia communication; person verification; Background noise; Graphics; Humans; Image analysis; Image converters; Mouth; Multimedia communication; Shape; Speech; Videoconference;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.599592
Filename
599592
Link To Document