Title :
Speech recognition for image animation and coding
Author :
Chou, Wu ; Chen, Homer H.
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
We discuss some issues related to acoustic assisted image coding and animation. An approach of talker independent acoustic assisted image coding and animation scheme is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions
Keywords :
acoustic signal processing; computer animation; hearing; image coding; interpolation; signal sampling; smoothing methods; speech recognition; visual perception; acoustic assisted image coding; audio domain; auditory perception; high rate viseme sequence; image animation; image coding; image domain viseme interpolation; image domain viseme smoothing; look-ahead moving interpolation; look-back moving interpolation; oversampled viseme sequence; sliding window encoder; speech recognition; talker independent animation; talker independent image coding; visual perception; Animation; Bit rate; Decoding; Humans; Image coding; Image sequences; Interpolation; Mouth; Shape; Smoothing methods; Speech recognition; Visual perception;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479939