DocumentCode
2716963
Title
Hierarchical face parsing via deep learning
Author
Luo, Ping ; Wang, Xiaogang ; Tang, Xiaoou
fYear
2012
fDate
16-21 June 2012
Firstpage
2480
Lastpage
2487
Abstract
This paper investigates how to parse (segment) facial components from face images which may be partially occluded. We propose a novel face parser, which recasts segmentation of face components as a cross-modality data transformation problem, i.e., transforming an image patch to a label map. Specifically, a face is represented hierarchically by parts, components, and pixel-wise labels. With this representation, our approach first detects faces at both the part- and component-levels, and then computes the pixel-wise label maps (Fig.1). Our part-based and component-based detectors are generatively trained with the deep belief network (DBN), and are discriminatively tuned by logistic regression. The segmentators transform the detected face components to label maps, which are obtained by learning a highly nonlinear mapping with the deep autoencoder. The proposed hierarchical face parsing is not only robust to partial occlusions but also provide richer information for face analysis and face synthesis compared with face keypoint detection and face alignment. The effectiveness of our algorithm is shown through several tasks on 2, 239 images selected from three datasets (e.g., LFW [12], BioID [13] and CUFSF [29]).
Keywords
belief networks; face recognition; image segmentation; learning (artificial intelligence); object detection; regression analysis; DBN; component-based detectors; cross-modality data transformation problem; deep autoencoder; deep belief network; deep learning; face alignment; face analysis; face component segmentation; face detection; face images; face keypoint detection; face parser; face synthesis; facial component parsing; hierarchical face parsing; image patch transformation; logistic regression; nonlinear mapping learning; part-based detectors; pixel-wise label maps; Detectors; Face; Image segmentation; Logistics; Robustness; Shape; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location
Providence, RI
ISSN
1063-6919
Print_ISBN
978-1-4673-1226-4
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2012.6247963
Filename
6247963
Link To Document