DocumentCode :
294760
Title :
Speech recognition for image animation and coding
Author :
Chou, Wu ; Chen, Homer H.
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
4
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
2253
Abstract :
We discuss some issues related to acoustic assisted image coding and animation. An approach of talker independent acoustic assisted image coding and animation scheme is studied. A perceptually based sliding window encoder is proposed. It utilizes the high rate (or oversampled) viseme sequence from the audio domain for image domain viseme interpolation and smoothing. The image domain visemes in our approach are dynamically constructed from a set of basic visemes. The look-ahead and look-back moving interpolations in the proposed approach provide an effective way to compensate the mismatch between auditory and visual perceptions
Keywords :
acoustic signal processing; computer animation; hearing; image coding; interpolation; signal sampling; smoothing methods; speech recognition; visual perception; acoustic assisted image coding; audio domain; auditory perception; high rate viseme sequence; image animation; image coding; image domain viseme interpolation; image domain viseme smoothing; look-ahead moving interpolation; look-back moving interpolation; oversampled viseme sequence; sliding window encoder; speech recognition; talker independent animation; talker independent image coding; visual perception; Animation; Bit rate; Decoding; Humans; Image coding; Image sequences; Interpolation; Mouth; Shape; Smoothing methods; Speech recognition; Visual perception;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479939
Filename :
479939
Link To Document :
بازگشت