Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face

Author

Mai Xu ; Xin Deng ; Shengxi Li ; Zulin Wang

Author_Institution

Sch. of Electron. & Inf. Eng., Beihang Univ., Beijing, China

Volume

8

Issue

3

fYear

2014

fDate

Jun-14

Firstpage

475

Lastpage

489

Abstract

In this paper, we propose a region-of-interest (ROI) based HEVC coding approach for conversational videos, with a novel hierarchical perception model of face (HP model), to improve the perceived visual quality of state-of-the-art HEVC standard. In contrast to the previous ROI-based video coding approaches, this novel HP model allows the unequal importance of facial features (e.g., the eyes and mouth) within the facial region, by generating a pixel-wise weight map. Benefitting from such a perception model, the adaptive coding tree unit (CTU) partition structure is developed to alleviate the encoding complexity of HEVC, without any degradation of the visual quality in facial regions, especially in the regions of facial features. Subsequently, for the rate control in HEVC a weight-based unified rate-quantization (URQ) scheme, instead of the conventional pixel-based URQ scheme, is proposed to adaptively adjust the value of quantization parameter (QP). Such an adaptive adjustment of QPs is capable of allocating more bits to the face/facial features with respect to our HP model, and as a result, the visual quality of face, in particular facial features, can be enhanced for conversational HEVC coding. Finally, the experimental results show that the perceived visual quality of our approach is greatly improved, with even less encoding time, for conversational video coding on the HEVC platform.

Keywords

adaptive codes; video coding; visual perception; adaptive coding tree unit partition structure; conversational HEVC coding; conversational video; face model; facial feature region; hierarchical perception model; region of interest based video coding; unified rate quantization; visual quality; Encoding; Face; Facial features; Feature extraction; Video coding; Videos; Visualization; HEVC; perceptual video compression; rate distortion; teleconferencing;

fLanguage

English

Journal_Title

Selected Topics in Signal Processing, IEEE Journal of

Publisher

ieee

ISSN

1932-4553

Type

jour

DOI

10.1109/JSTSP.2014.2314864

Filename

6782435