مرکز منطقه ای اطلاع رساني علوم و فناوري - Model-assisted coding of video teleconferencing sequences at low bit rates

DocumentCode :

293091

Title :

Model-assisted coding of video teleconferencing sequences at low bit rates

Author :

Eleftheriadis, Alexandros ; Jacquin, Arnaud

Author_Institution :

Dept. of Electr. Eng., Columbia Univ., New York, NY, USA

Volume :

fYear :

1994

fDate :

30 May-2 Jun 1994

Firstpage :

177

Abstract :

We present a novel and practical way to integrate techniques from computer vision to low bit rate coding systems for video teleconferencing applications. Our focus is to locate and track the faces of persons in typical head-and-shoulders video sequences, and to exploit the face location information in a “classical” video coding/decoding system. The motivation is to enable the system to selectively encode various image areas and to produce psychologically pleasing coded images where faces are sharper. We refer to this approach as model-assisted coding. We propose a totally automatic, low-complexity algorithm, which robustly performs face detection and tracking. A priori assumptions regarding sequence content are minimal and the algorithm operates accurately even in cases of occlusion by moving objects. Face location information is exploited by a low bit rate 3D subband-based video coder which uses a model-assisted dynamic bit allocation with object-selective quantization. By transferring a small fraction of the total available bit rate from the non-facial to the facial area, the coder produces images with better-rendered facial features. The improvement was found to be perceptually significant on video sequences coded at 96 kbps for an input luminance signal in CIF format. The technique is applicable to any video coding scheme that allows for fine-grain quantizer selection (e.g. MPEG, H.261), and can maintain full decoder compatibility

Keywords :

face recognition; image sequences; quantisation (signal); teleconferencing; video coding; 3D subband-based video coder; CIF format; dynamic bit allocation; face detection; face location information; fine-grain quantizer selection; full decoder compatibility; head-and-shoulders video sequences; input luminance signal; low bit rate coding; model-assisted coding; object-selective quantization; occlusion; sequence content; video teleconferencing sequences; Application software; Bit rate; Computer vision; Decoding; Face detection; Focusing; Psychology; Teleconferencing; Video coding; Video sequences;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Circuits and Systems, 1994. ISCAS '94., 1994 IEEE International Symposium on

Conference_Location :

London

Print_ISBN :

0-7803-1915-X

Type :

conf

DOI :

10.1109/ISCAS.1994.409135

Filename :

409135

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=293091