• DocumentCode
    2716963
  • Title

    Hierarchical face parsing via deep learning

  • Author

    Luo, Ping ; Wang, Xiaogang ; Tang, Xiaoou

  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    2480
  • Lastpage
    2487
  • Abstract
    This paper investigates how to parse (segment) facial components from face images which may be partially occluded. We propose a novel face parser, which recasts segmentation of face components as a cross-modality data transformation problem, i.e., transforming an image patch to a label map. Specifically, a face is represented hierarchically by parts, components, and pixel-wise labels. With this representation, our approach first detects faces at both the part- and component-levels, and then computes the pixel-wise label maps (Fig.1). Our part-based and component-based detectors are generatively trained with the deep belief network (DBN), and are discriminatively tuned by logistic regression. The segmentators transform the detected face components to label maps, which are obtained by learning a highly nonlinear mapping with the deep autoencoder. The proposed hierarchical face parsing is not only robust to partial occlusions but also provide richer information for face analysis and face synthesis compared with face keypoint detection and face alignment. The effectiveness of our algorithm is shown through several tasks on 2, 239 images selected from three datasets (e.g., LFW [12], BioID [13] and CUFSF [29]).
  • Keywords
    belief networks; face recognition; image segmentation; learning (artificial intelligence); object detection; regression analysis; DBN; component-based detectors; cross-modality data transformation problem; deep autoencoder; deep belief network; deep learning; face alignment; face analysis; face component segmentation; face detection; face images; face keypoint detection; face parser; face synthesis; facial component parsing; hierarchical face parsing; image patch transformation; logistic regression; nonlinear mapping learning; part-based detectors; pixel-wise label maps; Detectors; Face; Image segmentation; Logistics; Robustness; Shape; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6247963
  • Filename
    6247963