• DocumentCode
    105431
  • Title

    VideoPuzzle: Descriptive One-Shot Video Composition

  • Author

    Qiang Chen ; Meng Wang ; Zhongyang Huang ; Yang Hua ; Zheng Song ; Shuicheng Yan

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Nat. Univ. of Singapore, Singapore, Singapore
  • Volume
    15
  • Issue
    3
  • fYear
    2013
  • fDate
    Apr-13
  • Firstpage
    521
  • Lastpage
    534
  • Abstract
    A large amount of short, single-shot videos are created by personal camcorder every day, such as the small video clips in family albums, and thus a solution for presenting and managing these video clips is highly desired. From the perspective of professionalism and artistry, long-take/shot video, also termed one-shot video, is able to present events, persons or scenic spots in an informative manner. This paper presents a novel video composition system “Video Puzzle” which generates aesthetically enhanced long-shot videos from short video clips. Our task here is to automatically composite several related single shots into a virtual long-take video with spatial and temporal consistency. We propose a novel framework to compose descriptive long-take video with content-consistent shots retrieved from a video pool. For each video, frame-by-frame search is performed over the entire pool to find start-end content correspondences through a coarse-to-fine partial matching process. The content correspondence here is general and can refer to the matched regions or objects, such as human body and face. The content consistency of these correspondences enables us to design several shot transition schemes to seamlessly stitch one shot to another in a spatially and temporally consistent manner. The entire long-take video thus comprises several single shots with consistent contents and ίuent transitions. Meanwhile, with the generated matching graph of videos, the proposed system can also provide an efficient video browsing mode. Experiments are conducted on multiple video albums and the results demonstrate the effectiveness and the usefulness of the proposed scheme.
  • Keywords
    graph theory; search problems; video signal processing; coarse-to-fine partial matching process; content-consistent shots; descriptive one-shot video composition; events; frame-by-frame search; generated matching graph; human body; human face; multiple video albums; persons; scenic spots; short video clips; shot transition schemes; single-shot videos; spatial consistency; start-end content correspondences; temporal consistency; video browsing mode; video pool; video puzzle; virtual long-take video; Cameras; Educational institutions; Laboratories; Materials; Video equipment; Video sequences; Visualization; Image retrieval; one-shot video; video authoring; video transition;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2012.2236306
  • Filename
    6392964