• DocumentCode
    3484885
  • Title

    Face detection directly from h.264 compressed video with convolutional neural network

  • Author

    Zhuang, Shin-Shan ; Lai, Shang-Hong

  • Author_Institution
    Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
  • fYear
    2009
  • fDate
    7-10 Nov. 2009
  • Firstpage
    2485
  • Lastpage
    2488
  • Abstract
    Human faces provide a useful cue in indexing video content. In this paper, we propose a novel face detection algorithm based on a convolutional neural network architecture that can rapidly detect human face regions in video sequences encoded by H.264/AVC. By detecting faces directly in the compressed domain, we use the discrete cosine transform (DCT) coefficients in H.264 intra coding as the feature vector for face detection, thus it is not necessary to carry out additional DCT transform during the encoding or decoding process. With the face detector inside the video encoding process, we can adjust the coding parameters adaptively and allocate more resources to the macroblocks corresponding to the face regions. Some experimental results of applying the face detector on the H.264 intra coded images are given to demonstrate the performance of the proposed algorithm.
  • Keywords
    data compression; decoding; discrete cosine transforms; image sequences; indexing; neural net architecture; object detection; resource allocation; transform coding; video coding; DCT transform; H.264 compressed video; H.264 intracoded images; H.264/AVC; convolutional neural network architecture; decoding process; discrete cosine transform coefficient; human face region detection algorithm; resource allocation; video content indexing; video encoding process; video sequences; Convolutional codes; Detectors; Discrete cosine transforms; Encoding; Face detection; Humans; Indexing; Neural networks; Video compression; Video sequences; DCT; Face detection; neural network; video codec;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2009 16th IEEE International Conference on
  • Conference_Location
    Cairo
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4244-5653-6
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2009.5413922
  • Filename
    5413922