DocumentCode
2467743
Title
An audio-visual distance for audio-visual speech vector quantization
Author
Girin, L. ; Foucher, E. ; Feng, G.
Author_Institution
Inst. de la Commun. Parlee, ENSERG, Grenoble, France
fYear
1998
fDate
7-9 Dec 1998
Firstpage
523
Lastpage
528
Abstract
Speech is both an acoustic and a visual signal, and there exists some complementarity and redundancy between the two modalities. In the speech coding domain, it is of great interest to use this redundancy to improve speech coder performance. In this paper, we consider some audio and video joint coding process based on an audio-visual vector quantization. The method is shown to exploit quite well the audio-visual redundancy as it can reduce the bit rate while decreasing the quantization error. A notion of audio-visual distance has to be introduced and adapted to the different nature of the data. It is defined from an existing audio distance and a new visual distance, which is particularly focussed
Keywords
audio coding; speech coding; vector quantisation; video coding; audio and video joint coding process; audio-visual distance; audio-visual redundancy; audio-visual speech vector quantization; bit rate; speech coding; Bit rate; Decoding; Filtering; Filters; Pulse measurements; Redundancy; Speech coding; Speech processing; Vector quantization; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 1998 IEEE Second Workshop on
Conference_Location
Redondo Beach, CA
Print_ISBN
0-7803-4919-9
Type
conf
DOI
10.1109/MMSP.1998.739034
Filename
739034
Link To Document