DocumentCode :
1749859
Title :
Major cast detection in video using both audio and visual information
Author :
Liu, Zhu ; Wang, Yao
Author_Institution :
AT&T Labs - Res., Middletown, NJ, USA
Volume :
3
fYear :
2001
fDate :
2001
Firstpage :
1413
Abstract :
Major casts, for example, the anchor persons or reporters in news broadcast programs and principle characters in movies play an important role in video, and their occurrences provide good indices for organizing and presenting video content. This paper describes a new approach for automatically generating the list of major casts in a video sequence based on multiple modalities, specifically, both speaker and face information. A list of major casts is created and ordered by the accumulative temporal and spatial presence of corresponding casts. Preliminary simulation results show that the detected major casts are meaningful and the proposed approach is promising
Keywords :
audio signal processing; content-based retrieval; face recognition; feature extraction; image retrieval; video databases; video signal processing; accumulative temporal presence; audio visual information; content description; face information; indices; major cast detection; spatial presence; speaker information; video content; video sequence; Broadcasting; Data mining; Detection algorithms; Face detection; Layout; Motion pictures; Multimedia communication; Music information retrieval; Speech analysis; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.941194
Filename :
941194
Link To Document :
بازگشت