Title :
Summarizing video using non-negative similarity matrix factorization
Author :
Cooper, Matthew ; Foote, Jonathan
Author_Institution :
FX Palo Alto Lab., CA, USA
Abstract :
We present a novel approach to automatically extracting summary excerpts from audio video and video. Our approach is to maximize the average similarity between the excerpt and the source. We first calculate a similarity matrix by comparing each pair of time samples using a quantitative similarity measure. To determine the segment with highest average similarity, we maximize the summation of the self-similarity matrix over the support of the segment. To select multiple excerpts while avoiding redundancy, we compute the non-negative matrix factorization (NMF) of the similarity matrix into its essential structural components. We then build a summary comprised of excerpts from the main components, selecting the excerpts for maximum average similarity within each component. Variations integrating segmentation and other information are also discussed, and experimental results are presented.
Keywords :
audio signal processing; feature extraction; image segmentation; matrix decomposition; video signal processing; audio excerpts; maximum average similarity; multiple excerpts; nonnegative similarity matrix factorization; quantitative similarity measures; segment support; segmentation integration; self-similarity matrix; video excerpts; video summary; Bandwidth; Buildings; Concatenated codes; Frequency; Information retrieval; Laboratories; Layout; Matrix decomposition; Production facilities; Time measurement;
Conference_Titel :
Multimedia Signal Processing, 2002 IEEE Workshop on
Print_ISBN :
0-7803-7713-3
DOI :
10.1109/MMSP.2002.1203239