DocumentCode :
177576
Title :
Multiple-view constrained clustering for unsupervised face identification in TV-broadcast
Author :
Bendris, Meriem ; Favre, Benoit ; Charlet, D. ; Damnati, Geraldine ; Auguste, Remi
Author_Institution :
Aix Marseille Univ., Marseille, France
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
494
Lastpage :
498
Abstract :
Our goal is to automatically identify faces in TV broadcast without a pre-defined dictionary of identities. Most methods are based on identity detection (from OCR and ASR) and require a propagation strategy based on visual clustering. In TV content, people appear with many variations making the clustering difficult. In this case, speaker clustering can be a reliable link for face clustering. We propose in this paper to build automatically an incomplete speaker-face mapping based on local evidence of OCR and Lip activity links. Then, we propose schemes of speaker constraints propagation to the face constrained-clustering problem. Experiments performed on the REPERE corpus show an improvement of face identification by propagating names to face clusters (+3.7% F-measure compared to the baseline).
Keywords :
face recognition; speaker recognition; television broadcasting; ASR; OCR; REPERE corpus; TV broadcast; TV content; face constrained-clustering problem; identity detection; incomplete speaker-face mapping; lip activity links; multiple view constrained clustering; speaker clustering; speaker constraints propagation; unsupervised face identification; visual clustering; Clustering algorithms; Face; Optical character recognition software; Speech; TV; TV broadcasting; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6853645
Filename :
6853645
Link To Document :
بازگشت