DocumentCode
301259
Title
Speech-assisted lip synchronization in audio-visual communications
Author
Tsuhan Chen ; Graf, H.P. ; Haskell, B. ; Petajan, E. ; Yao Wang ; Chen, Homer ; Chou, Wu
Author_Institution
AT&T Bell Labs., Holmdel, NJ, USA
Volume
2
fYear
1995
fDate
23-26 Oct 1995
Firstpage
579
Abstract
We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined
Keywords
audio-visual systems; image processing; speech processing; synchronisation; teleconferencing; video coding; videotelephony; audio-visual communications; image processing; speech analysis; speech information; speech-assisted frame-rate conversion; speech-assisted lip synchronization; talking head video coding; video telephony; videoconferencing; Decoding; Head; Humans; Image coding; Image converters; Image motion analysis; Image processing; Image sequence analysis; Mouth; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 1995. Proceedings., International Conference on
Conference_Location
Washington, DC
Print_ISBN
0-8186-7310-9
Type
conf
DOI
10.1109/ICIP.1995.537545
Filename
537545
Link To Document