Title :
Talking faces indexing in TV-content
Author :
Bendris, Meriem ; Charlet, Delphine ; Chollet, Gérard
Author_Institution :
R&D, Orange Labs., France Telecom, Issy-Les-Moulineaux, France
Abstract :
Our objective is to index talking faces in a TV-Context: build a description of TV-content, in terms of talking people, without any pre-defined dictionary of identities. In TV-content, because of multi-face shots and non-speaking face shots, it is difficult to determine which face is speaking. In this work, a method is proposed which clusters people independently by the audio and by the visual information and combines these clusterings of people (audio and visual) in order to detect sequences of talking faces. The audio indexing system is based on agglomerative clustering with the Bayesian Information Criterion. The visual indexing system is based on costume detection and clustering of color histograms. The combination of both indexes is based on searching for the best match between both clusterings, to obtain a correspondence between the automatic audio labels and the automatic video labels. The talking faces are then determined by the intersection of the segments of the associated audio and video labels. Results of experiments on a TV-Show database show that a high correct detection rate can be achieved by the proposed method.
Keywords :
Bayes methods; audio signal processing; image colour analysis; indexing; pattern clustering; video signal processing; Bayesian information criterion; TV content; agglomerative clustering; audio indexing system; automatic audio labels; automatic video labels; color histograms clustering; costume detection; nonspeaking face shot; talking faces indexing; talking faces sequences detection; visual indexing system; Acoustic measurements; Bayesian methods; Clustering methods; Dictionaries; Face detection; Indexing; Research and development; Speech analysis; Telecommunications; Video on demand; Talking faces indexing; audio-visual indexing; speaker clustering; video clustering;
Conference_Titel :
Content-Based Multimedia Indexing (CBMI), 2010 International Workshop on
Conference_Location :
Grenoble
Print_ISBN :
978-1-4244-8028-9
Electronic_ISBN :
1949-3983
DOI :
10.1109/CBMI.2010.5529907