DocumentCode
2291491
Title
A method and browser for cross-referenced video summaries
Author
Aner, A. ; Tang, Lijun ; Kender, John R.
Author_Institution
Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
Volume
2
fYear
2002
fDate
2002
Firstpage
237
Abstract
We present an automatic tool for compact representation and cross-referencing of long video sequences, which is based on a novel visual abstraction of semantic content. Our highly compact hierarchical representation results from the non-temporal clustering of scene segments into a new conceptual form grounded in the recognition of real-world backgrounds. We represent shots and scenes using mosaics and employ a novel method for the comparison of scenes based on these representative mosaics. We then cluster scenes together into a higher level of abstraction-the physical setting. We demonstrate our work using situation comedies (sitcoms), where each half-hour episode is well structured by rules governing background use. Consequently, browsing, indexing and comparison across videos by physical setting is very fast. Further, we show that physical settings lead to a higher-level contextual identification of the main plots in each video. We demonstrate these contributions with a browsing tool whose top-level single page displays the settings of several episodes. This page expands to display windows for each episode, and each episode menu summary is further expanded into scenes and shots, all by mouse-clicking on appropriate plots and settings according to user interests.
Keywords
image representation; image segmentation; image sequences; indexing; information retrieval; pattern clustering; video signal processing; automatic tool; background use; browsing; compact representation; comparison; contextual identification; cross-referenced video summaries; cross-referencing; episode menu summary; half-hour episode; highly compact hierarchical representation; indexing; long video sequences; mosaics; nontemporal clustering; physical setting; plots; real-world backgrounds; recognition; scene segments; semantic content; sitcoms; situation comedies; top-level single page; visual abstraction; Broadcasting; Cameras; Computer science; Displays; Indexing; Layout; Motion pictures; Multimedia communication; Video compression; Video sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7803-7304-9
Type
conf
DOI
10.1109/ICME.2002.1035560
Filename
1035560
Link To Document