DocumentCode :
2291491
Title :
A method and browser for cross-referenced video summaries
Author :
Aner, A. ; Tang, Lijun ; Kender, John R.
Author_Institution :
Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
Volume :
2
fYear :
2002
fDate :
2002
Firstpage :
237
Abstract :
We present an automatic tool for compact representation and cross-referencing of long video sequences, which is based on a novel visual abstraction of semantic content. Our highly compact hierarchical representation results from the non-temporal clustering of scene segments into a new conceptual form grounded in the recognition of real-world backgrounds. We represent shots and scenes using mosaics and employ a novel method for the comparison of scenes based on these representative mosaics. We then cluster scenes together into a higher level of abstraction-the physical setting. We demonstrate our work using situation comedies (sitcoms), where each half-hour episode is well structured by rules governing background use. Consequently, browsing, indexing and comparison across videos by physical setting is very fast. Further, we show that physical settings lead to a higher-level contextual identification of the main plots in each video. We demonstrate these contributions with a browsing tool whose top-level single page displays the settings of several episodes. This page expands to display windows for each episode, and each episode menu summary is further expanded into scenes and shots, all by mouse-clicking on appropriate plots and settings according to user interests.
Keywords :
image representation; image segmentation; image sequences; indexing; information retrieval; pattern clustering; video signal processing; automatic tool; background use; browsing; compact representation; comparison; contextual identification; cross-referenced video summaries; cross-referencing; episode menu summary; half-hour episode; highly compact hierarchical representation; indexing; long video sequences; mosaics; nontemporal clustering; physical setting; plots; real-world backgrounds; recognition; scene segments; semantic content; sitcoms; situation comedies; top-level single page; visual abstraction; Broadcasting; Cameras; Computer science; Displays; Indexing; Layout; Motion pictures; Multimedia communication; Video compression; Video sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7803-7304-9
Type :
conf
DOI :
10.1109/ICME.2002.1035560
Filename :
1035560
Link To Document :
بازگشت