DocumentCode
1341736
Title
Audio-to-visual conversion for multimedia communication
Author
Rao, Ram R. ; Chen, Tsuhan ; Mersereau, Russell M.
Author_Institution
AT&T Bell Labs., Holmdel, NJ, USA
Volume
45
Issue
1
fYear
1998
fDate
2/1/1998 12:00:00 AM
Firstpage
15
Lastpage
22
Abstract
Although humans rely primarily on hearing to process speech, they can also extract a great deal of information with their eyes through lipreading. This skill becomes extremely important when the acoustic signal is degraded by noise. It would, therefore, be beneficial to find methods to reinforce acoustic speech with a synthesized visual signal for high noise environments. This paper addresses the interaction between acoustic speech and visible speech. Algorithms for converting audible speech into visible speech are examined, and applications which can utilize this conversion process are presented. Our results demonstrate that it is possible to animate a natural-looking talking head using acoustic speech as an input
Keywords
acoustic signal processing; audio-visual systems; computer animation; correlation methods; multimedia communication; speech processing; video coding; acoustic signal; acoustic speech; algorithms; audible speech; audio-to-visual conversion; audio-visual communications; audio-visual correlation; high noise environments; lipreading; multimedia communication; synthesized visual signal; talking head animation; video coding; visible speech; Acoustic noise; Auditory system; Data mining; Degradation; Eyes; Humans; Multimedia communication; Speech processing; Speech synthesis; Working environment noise;
fLanguage
English
Journal_Title
Industrial Electronics, IEEE Transactions on
Publisher
ieee
ISSN
0278-0046
Type
jour
DOI
10.1109/41.661300
Filename
661300
Link To Document