DocumentCode :
254508
Title :
Joint Summarization of Large-Scale Collections of Web Images and Videos for Storyline Reconstruction
Author :
Gunhee Kim ; Sigal, Leonid ; Xing, Eric P.
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
4225
Lastpage :
4232
Abstract :
In this paper, we address the problem of jointly summarizing large sets of Flickr images and YouTube videos. Starting from the intuition that the characteristics of the two media types are different yet complementary, we develop a fast and easily-parallelizable approach for creating not only high-quality video summaries but also novel structural summaries of online images as storyline graphs. The storyline graphs can illustrate various events or activities associated with the topic in a form of a branching network. The video summarization is achieved by diversity ranking on the similarity graphs between images and video frames. The reconstruction of storyline graphs is formulated as the inference of sparse time-varying directed graphs from a set of photo streams with assistance of videos. For evaluation, we collect the datasets of 20 outdoor activities, consisting of 2.7M Flickr images and 16K YouTube videos. Due to the large-scale nature of our problem, we evaluate our algorithm via crowdsourcing using Amazon Mechanical Turk. In our experiments, we demonstrate that the proposed joint summarization approach outperforms other baselines and our own methods using videos or images only.
Keywords :
Internet; directed graphs; image reconstruction; social networking (online); video signal processing; Amazon Mechanical Turk; Flickr images; YouTube videos; crowdsourcing; diversity ranking; easily-parallelizable approach; high-quality video summaries; joint summarization approach; large-scale Web image collections; large-scale Web video collections; online image structural summaries; photo streams; similarity graphs; sparse time-varying directed graphs; storyline graph reconstruction; video frames; video summarization; Image edge detection; Image reconstruction; Optimization; Streaming media; Videos; Visualization; YouTube; Storyline reconstruction; Video summarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPR.2014.538
Filename :
6909934
Link To Document :
بازگشت