DocumentCode :
432548
Title :
Detection of unique people in news programs using multimodal shot clustering
Author :
Taskiran, Cuneyt M. ; Albiol, Alberto ; Torres, Luis ; Delp, Edward J.
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
Volume :
1
fYear :
2004
fDate :
24-27 Oct. 2004
Firstpage :
697
Abstract :
In this paper, we describe an approach that uses a combination of visual and audio features to cluster shots belonging to the same person in video programs. We use color histograms extracted from keyframes and faces, as well as cepstral coefficients derived from audio to calculate pairwise shot distances. These distances are then normalized and combined to a single confidence value which reflects our certainty that two shots contain the same person. We then use an agglomerative clustering algorithm to cluster shots based on these confidence values. We report the results of our system on a data set of approximately 8 hours of programming.
Keywords :
cepstral analysis; face recognition; feature extraction; image colour analysis; indexing; speaker recognition; video signal processing; agglomerative clustering algorithm; audio cepstral coefficients; audio features; face color histogram; face detection; feature extraction; identification confidence value; keyframe color histogram; multimodal shot clustering; news programs; normalized pairwise shot distances; program indexing; speaker detection; speech recognition; video analysis; video program unique people detection; visual features; Cameras; Clustering algorithms; DVD; Face detection; Graphics; Gunshot detection systems; Histograms; Indexing; Multimedia communication; TV broadcasting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2004. ICIP '04. 2004 International Conference on
ISSN :
1522-4880
Print_ISBN :
0-7803-8554-3
Type :
conf
DOI :
10.1109/ICIP.2004.1418850
Filename :
1418850
Link To Document :
بازگشت