DocumentCode
2589777
Title
Identifying individuals in video by combining ´generative´ and discriminative head models
Author
Everingham, Mark ; Zisserman, Andrew
Author_Institution
Dept. of Eng. Sci., Oxford Univ.
Volume
2
fYear
2005
fDate
17-21 Oct. 2005
Firstpage
1103
Abstract
The objective of this work is automatic detection and identification of individuals in unconstrained consumer video, given a minimal number of labelled faces as training data. Whilst much work has been done on (mainly frontal) face detection and recognition, current methods are not sufficiently robust to deal with the wide variations in pose and appearance found in such video. These include variations in scale, illumination, expression, partial occlusion, motion blur, etc. We describe two areas of innovation: the first is to capture the 3-D appearance of the entire head, rather than just the face region, so that visual features such as the hairline can be exploited. The second is to combine discriminative and ´generative´ approaches for detection and recognition. Images rendered using the head model are used to train a discriminative tree-structured classifier giving efficient detection and pose estimates over a very wide pose range with three degrees of freedom. Subsequent verification of the identity is obtained using the head model in a ´generative´ framework. We demonstrate excellent performance in detecting and identifying three characters and their poses in a TV situation comedy
Keywords
face recognition; feature extraction; object detection; pattern classification; solid modelling; 3D appearance; TV situation comedy; automatic detection; automatic identification; degrees of freedom; discriminative head model; discriminative tree-structured classifier; face detection; face recognition; generative head model; labelled faces; unconstrained consumer video; visual feature; Classification tree analysis; Face detection; Face recognition; Head; Lighting; Rendering (computer graphics); Robustness; TV; Technological innovation; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, 2005. ICCV 2005. Tenth IEEE International Conference on
Conference_Location
Beijing
ISSN
1550-5499
Print_ISBN
0-7695-2334-X
Type
conf
DOI
10.1109/ICCV.2005.116
Filename
1544844
Link To Document