DocumentCode
1349941
Title
Audio-visual integration in multimodal communication
Author
Chen, Tsuhan ; Rao, Ram R.
Author_Institution
AT&T Bell Labs., Holmdel, NJ, USA
Volume
86
Issue
5
fYear
1998
fDate
5/1/1998 12:00:00 AM
Firstpage
837
Lastpage
852
Abstract
We review recent research that examines audio-visual integration in multimodal communication. The topics include bimodality in human speech, human and automated lip reading, facial animation, lip synchronization, joint audio-video coding, and bimodal speaker verification. We also study the enabling technologies for these research topics, including automatic facial-feature tracking and audio-to-visual mapping. Recent progress in audio-visual research shows that joint processing of audio and video provides advantages that are not available when the audio and video are processed independently
Keywords
audio coding; audio-visual systems; computer animation; multimedia communication; speaker recognition; synchronisation; video coding; voice communication; audio-to-visual mapping; audio-video coding; audio-visual integration; automatic facial-feature tracking; bimodal speaker verification; bimodality; facial animation; human speech; lip reading; lip synchronization; multimedia communication; multimodal communication; research; speech communication; video signal processing; Computer graphics; Facial animation; Humans; Image coding; Signal synthesis; Speech analysis; Speech synthesis; Video compression; Video signal processing; Videoconference;
fLanguage
English
Journal_Title
Proceedings of the IEEE
Publisher
ieee
ISSN
0018-9219
Type
jour
DOI
10.1109/5.664274
Filename
664274
Link To Document