مرکز منطقه ای اطلاع رساني علوم و فناوري - Large scale learning and recognition of faces in web videos

DocumentCode :

3135396

Title :

Large scale learning and recognition of faces in web videos

Author :

Zhao, John ; Yagnik, Jay ; Adam, Hartwig ; Bau, David

fYear :

2008

fDate :

17-19 Sept. 2008

Firstpage :

Lastpage :

Abstract :

The phenomenal growth of video on the Web and the increasing sparseness of meta information associated with it forces us to look for signals from the video content for search/information retrieval and browsing based corpus exploration. A large chunk of users´ searching/browsing patterns are centered around people present in the video. Doing it at scale in videos remains hard due to a) the absence of labeled data for such a large set of people and b) the large variation of pose/illumination/expression/age/occlusion/quality etc in the target corpus. We propose a system that can learn and recognize faces by combining signals from large scale weakly labeled text, image, and video corpora. First, consistency learning is proposed to create face models for popular persons. We use the text-image co-occurrence on the web as a weak signal of relevance and learn the set of consistent face models from this very large and noisy training set. Second, efficient and accurate face detection and face tracking is applied. Last, the key faces in each face track is select by clustering to get compact and robust representation. The face tracks are further clustered to get more representative key faces and remove duplicate key faces. For each cluster of face tracks, a combination of majority voting and probabilistic voting is done with the automatically learned models. The effectiveness of our framework is demonstrated by results on image and video corpora, in which we can achieve 92.68% in 37 million images and 80% top-5-precision in 1500 hours videos.

Keywords :

Internet; face recognition; information retrieval; object detection; probability; target tracking; video signal processing; Web videos; face detection; face models; face recognition; face tracking; information retrieval; large scale learning; probabilistic voting; text-image co-occurrence; video content; Content based retrieval; Face detection; Face recognition; Image recognition; Information retrieval; Large-scale systems; Lighting; Text recognition; Videos; Voting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Face & Gesture Recognition, 2008. FG '08. 8th IEEE International Conference on

Conference_Location :

Amsterdam

Print_ISBN :

978-1-4244-2153-4

Electronic_ISBN :

978-1-4244-2154-1

Type :

conf

DOI :

10.1109/AFGR.2008.4813381

Filename :

4813381

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3135396