DocumentCode :
3484885
Title :
Face detection directly from h.264 compressed video with convolutional neural network
Author :
Zhuang, Shin-Shan ; Lai, Shang-Hong
Author_Institution :
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
fYear :
2009
fDate :
7-10 Nov. 2009
Firstpage :
2485
Lastpage :
2488
Abstract :
Human faces provide a useful cue in indexing video content. In this paper, we propose a novel face detection algorithm based on a convolutional neural network architecture that can rapidly detect human face regions in video sequences encoded by H.264/AVC. By detecting faces directly in the compressed domain, we use the discrete cosine transform (DCT) coefficients in H.264 intra coding as the feature vector for face detection, thus it is not necessary to carry out additional DCT transform during the encoding or decoding process. With the face detector inside the video encoding process, we can adjust the coding parameters adaptively and allocate more resources to the macroblocks corresponding to the face regions. Some experimental results of applying the face detector on the H.264 intra coded images are given to demonstrate the performance of the proposed algorithm.
Keywords :
data compression; decoding; discrete cosine transforms; image sequences; indexing; neural net architecture; object detection; resource allocation; transform coding; video coding; DCT transform; H.264 compressed video; H.264 intracoded images; H.264/AVC; convolutional neural network architecture; decoding process; discrete cosine transform coefficient; human face region detection algorithm; resource allocation; video content indexing; video encoding process; video sequences; Convolutional codes; Detectors; Discrete cosine transforms; Encoding; Face detection; Humans; Indexing; Neural networks; Video compression; Video sequences; DCT; Face detection; neural network; video codec;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2009 16th IEEE International Conference on
Conference_Location :
Cairo
ISSN :
1522-4880
Print_ISBN :
978-1-4244-5653-6
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2009.5413922
Filename :
5413922
Link To Document :
بازگشت