• DocumentCode
    1341736
  • Title

    Audio-to-visual conversion for multimedia communication

  • Author

    Rao, Ram R. ; Chen, Tsuhan ; Mersereau, Russell M.

  • Author_Institution
    AT&T Bell Labs., Holmdel, NJ, USA
  • Volume
    45
  • Issue
    1
  • fYear
    1998
  • fDate
    2/1/1998 12:00:00 AM
  • Firstpage
    15
  • Lastpage
    22
  • Abstract
    Although humans rely primarily on hearing to process speech, they can also extract a great deal of information with their eyes through lipreading. This skill becomes extremely important when the acoustic signal is degraded by noise. It would, therefore, be beneficial to find methods to reinforce acoustic speech with a synthesized visual signal for high noise environments. This paper addresses the interaction between acoustic speech and visible speech. Algorithms for converting audible speech into visible speech are examined, and applications which can utilize this conversion process are presented. Our results demonstrate that it is possible to animate a natural-looking talking head using acoustic speech as an input
  • Keywords
    acoustic signal processing; audio-visual systems; computer animation; correlation methods; multimedia communication; speech processing; video coding; acoustic signal; acoustic speech; algorithms; audible speech; audio-to-visual conversion; audio-visual communications; audio-visual correlation; high noise environments; lipreading; multimedia communication; synthesized visual signal; talking head animation; video coding; visible speech; Acoustic noise; Auditory system; Data mining; Degradation; Eyes; Humans; Multimedia communication; Speech processing; Speech synthesis; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Industrial Electronics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0278-0046
  • Type

    jour

  • DOI
    10.1109/41.661300
  • Filename
    661300