Title :
Region-of-interest based H.264 encoder for videophone with a hardware macroblock level face detector
Author :
Zhang, Tianruo ; Liu, Chen ; Wang, Minghui ; Goto, Satoshi
Author_Institution :
Grad. Sch. of Inf., Production & Syst., Waseda Univ., Kitakyushu, Japan
Abstract :
Region-of-interest (ROI) can be applied in H.264 video encoder to enhance subjective quality and reduce computation complexity. For the aiming application of low cost hardware real-time encoder in videophone with faces as ROI, this paper proposes a face detection algorithm to detect each macroblock (MB) as one part of a face or not. This face detection algorithm has a unique estimation-and-verification process and can be combined with a H.264 encoder by MB level pipeline architecture. 97.91% MBs in faces can be detected. VLSI architecture of proposed face detection algorithm is designed and an area of 4.3 k gates is achieved. Power consumption is only 1.45 mW at 100 MHz. A ROI based H.264 encoder with dynamic parameters is proposed to enhance subjective quality and reduce the rate-distortion-optimization (RDO) complexity. The PSNR in ROI increases for 4.8 dB under similar bit rate. Encoding time is reduced to 54.4% in videophone-like sequences.
Keywords :
codecs; computational complexity; face recognition; rate distortion theory; video coding; videotelephony; H.264 encoder; MB level pipeline architecture; ROI; computation complexity; estimation-and-verification process; face detection algorithm; hardware macroblock level face detector; rate-distortion-optimization complexity; region-of-interest; subjective quality; video encoder; videophone; Algorithm design and analysis; Bit rate; Costs; Detectors; Energy consumption; Face detection; Hardware; PSNR; Pipelines; Very large scale integration;
Conference_Titel :
Multimedia Signal Processing, 2009. MMSP '09. IEEE International Workshop on
Conference_Location :
Rio De Janeiro
Print_ISBN :
978-1-4244-4463-2
Electronic_ISBN :
978-1-4244-4464-9
DOI :
10.1109/MMSP.2009.5293338