DocumentCode
3209979
Title
Names and faces in the news
Author
Berg, Tamara L. ; Berg, Alexander C. ; Edwards, Jaety ; Maire, Michael ; White, Ryan ; Teh, Yee-Whye ; Learned-Miller, Erik ; Forsyth, D.A.
Author_Institution
Div. of Comput. Sci., California Univ., Berkeley, CA, USA
Volume
2
fYear
2004
fDate
27 June-2 July 2004
Abstract
We show quite good face clustering is possible for a dataset of inaccurately and ambiguously labelled face images. Our dataset is 44,773 face images, obtained by applying a face finder to approximately half a million captioned news images. This dataset is more realistic than usual face recognition datasets, because it contains faces captured "in the wild" in a variety of configurations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. Each face image is associated with a set of names, automatically extracted from the associated caption. Many, but not all such sets contain the correct name. We cluster face images in appropriate discriminant coordinates. We use a clustering procedure to break ambiguities in labelling and identify incorrectly labelled faces. A merging procedure then identifies variants of names that refer to the same individual. The resulting representation can be used to label faces in news images or to organize news pictures by individuals present. An alternative view of our procedure is as a process that cleans up noisy supervised data. We demonstrate how to use entropy measures to evaluate such procedures.
Keywords
entropy; face recognition; noise; pattern clustering; ambiguously labelled face images; captioned news images; discriminant coordinates; entropy measures; face clustering; face finder; face recognition datasets; noisy supervised data;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on
ISSN
1063-6919
Print_ISBN
0-7695-2158-4
Type
conf
DOI
10.1109/CVPR.2004.1315253
Filename
1315253
Link To Document