مرکز منطقه ای اطلاع رساني علوم و فناوري - An audio-visual distance for audio-visual speech vector quantization

DocumentCode :

2467743

Title :

An audio-visual distance for audio-visual speech vector quantization

Author :

Girin, L. ; Foucher, E. ; Feng, G.

Author_Institution :

Inst. de la Commun. Parlee, ENSERG, Grenoble, France

fYear :

1998

fDate :

7-9 Dec 1998

Firstpage :

523

Lastpage :

528

Abstract :

Speech is both an acoustic and a visual signal, and there exists some complementarity and redundancy between the two modalities. In the speech coding domain, it is of great interest to use this redundancy to improve speech coder performance. In this paper, we consider some audio and video joint coding process based on an audio-visual vector quantization. The method is shown to exploit quite well the audio-visual redundancy as it can reduce the bit rate while decreasing the quantization error. A notion of audio-visual distance has to be introduced and adapted to the different nature of the data. It is defined from an existing audio distance and a new visual distance, which is particularly focussed

Keywords :

audio coding; speech coding; vector quantisation; video coding; audio and video joint coding process; audio-visual distance; audio-visual redundancy; audio-visual speech vector quantization; bit rate; speech coding; Bit rate; Decoding; Filtering; Filters; Pulse measurements; Redundancy; Speech coding; Speech processing; Vector quantization; Vocoders;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia Signal Processing, 1998 IEEE Second Workshop on

Conference_Location :

Redondo Beach, CA

Print_ISBN :

0-7803-4919-9

Type :

conf

DOI :

10.1109/MMSP.1998.739034

Filename :

739034

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2467743