Title :
Audio-to-visual conversion for multimedia communication
Author :
Rao, Ram R. ; Chen, Tsuhan ; Mersereau, Russell M.
Author_Institution :
AT&T Bell Labs., Holmdel, NJ, USA
fDate :
2/1/1998 12:00:00 AM
Abstract :
Although humans rely primarily on hearing to process speech, they can also extract a great deal of information with their eyes through lipreading. This skill becomes extremely important when the acoustic signal is degraded by noise. It would, therefore, be beneficial to find methods to reinforce acoustic speech with a synthesized visual signal for high noise environments. This paper addresses the interaction between acoustic speech and visible speech. Algorithms for converting audible speech into visible speech are examined, and applications which can utilize this conversion process are presented. Our results demonstrate that it is possible to animate a natural-looking talking head using acoustic speech as an input
Keywords :
acoustic signal processing; audio-visual systems; computer animation; correlation methods; multimedia communication; speech processing; video coding; acoustic signal; acoustic speech; algorithms; audible speech; audio-to-visual conversion; audio-visual communications; audio-visual correlation; high noise environments; lipreading; multimedia communication; synthesized visual signal; talking head animation; video coding; visible speech; Acoustic noise; Auditory system; Data mining; Degradation; Eyes; Humans; Multimedia communication; Speech processing; Speech synthesis; Working environment noise;
Journal_Title :
Industrial Electronics, IEEE Transactions on