Title :
Video skimming and characterization through the combination of image and language understanding
Author :
Smith, Michael A. ; Kanade, Takeo
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Digital video is rapidly becoming important for education, entertainment and a host of multimedia applications. With the size of the video collections growing to thousands of hours, technology is needed to effectively browse segments in a short time without losing the content of the video. We propose a method to extract the significant audio and video information and create a skim video which represents a very short synopsis of the original. The goal of this work is to show the utility of integrating language and image understanding techniques for video skimming by extraction of significant information, such as specific objects, audio keywords and relevant video structure. The resulting skim video is much shorter; where compaction is as high as 20:1, and yet retains the essential content of the original segment. We have conducted a user-study to test the content summarization and effectiveness of the skim as a browsing tool
Keywords :
feature extraction; image segmentation; multimedia computing; natural languages; query processing; visual databases; audio information; audio keywords; browsing; content summarization; digital video; education; entertainment; image segmentation; image understanding; language understanding; multimedia applications; user study; video characterization; video collections; video skimming; Application software; Auditory displays; Cameras; Data mining; Face detection; Image analysis; Image segmentation; Layout; Object detection; Software libraries;
Conference_Titel :
Content-Based Access of Image and Video Database, 1998. Proceedings., 1998 IEEE International Workshop on
Conference_Location :
Bombay
Print_ISBN :
0-8186-8329-5
DOI :
10.1109/CAIVD.1998.646034