DocumentCode
1313395
Title
Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection
Author
Cong, Yang ; Yuan, Junsong ; Luo, Jiebo
Author_Institution
Dept. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume
14
Issue
1
fYear
2012
Firstpage
66
Lastpage
75
Abstract
The rapid growth of consumer videos requires an effective and efficient content summarization method to provide a user-friendly way to manage and browse the huge amount of video data. Compared with most previous methods that focus on sports and news videos, the summarization of personal videos is more challenging because of its unconstrained content and the lack of any pre-imposed video structures. We formulate video summarization as a novel dictionary selection problem using sparsity consistency, where a dictionary of key frames is selected such that the original video can be best reconstructed from this representative dictionary. An efficient global optimization algorithm is introduced to solve the dictionary selection model with the convergence rates as O(1/K2) (where K is the iteration counter), in contrast to traditional sub-gradient descent methods of O(1/√K). Our method provides a scalable solution for both key frame extraction and video skim generation, because one can select an arbitrary number of key frames to represent the original videos. Experiments on a human labeled benchmark dataset and comparisons to the state-of-the-art methods demonstrate the advantages of our algorithm.
Keywords
computational complexity; convergence of numerical methods; dictionaries; feature extraction; gradient methods; video signal processing; consumer video summarization; content summarization method; convergence rates; key frame extraction; sparse dictionary selection problem; sparsity consistency; sub-gradient descent methods; video data browsing; video data management; video skim generation; Cameras; Databases; Dictionaries; Electronic mail; Feature extraction; Image reconstruction; Videos; Group sparse; Lasso; key frame; scene analysis; video analysis; video skim; video summarization;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2011.2166951
Filename
6008652
Link To Document