DocumentCode :
2467743
Title :
An audio-visual distance for audio-visual speech vector quantization
Author :
Girin, L. ; Foucher, E. ; Feng, G.
Author_Institution :
Inst. de la Commun. Parlee, ENSERG, Grenoble, France
fYear :
1998
fDate :
7-9 Dec 1998
Firstpage :
523
Lastpage :
528
Abstract :
Speech is both an acoustic and a visual signal, and there exists some complementarity and redundancy between the two modalities. In the speech coding domain, it is of great interest to use this redundancy to improve speech coder performance. In this paper, we consider some audio and video joint coding process based on an audio-visual vector quantization. The method is shown to exploit quite well the audio-visual redundancy as it can reduce the bit rate while decreasing the quantization error. A notion of audio-visual distance has to be introduced and adapted to the different nature of the data. It is defined from an existing audio distance and a new visual distance, which is particularly focussed
Keywords :
audio coding; speech coding; vector quantisation; video coding; audio and video joint coding process; audio-visual distance; audio-visual redundancy; audio-visual speech vector quantization; bit rate; speech coding; Bit rate; Decoding; Filtering; Filters; Pulse measurements; Redundancy; Speech coding; Speech processing; Vector quantization; Vocoders;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 1998 IEEE Second Workshop on
Conference_Location :
Redondo Beach, CA
Print_ISBN :
0-7803-4919-9
Type :
conf
DOI :
10.1109/MMSP.1998.739034
Filename :
739034
Link To Document :
بازگشت