• DocumentCode
    1341721
  • Title

    Efficient summarization of stereoscopic video sequences

  • Author

    Doulamis, Nikolaos D. ; Doulamis, Anastasios D. ; Avrithis, Yannis S. ; Ntalianis, Klimis S. ; Kollias, Stefanos D.

  • Author_Institution
    Electr. & Comput. Eng. Dept., Nat. Tech. Univ. of Athens, Greece
  • Volume
    10
  • Issue
    4
  • fYear
    2000
  • fDate
    6/1/2000 12:00:00 AM
  • Firstpage
    501
  • Lastpage
    517
  • Abstract
    An efficient technique for summarization of stereoscopic video sequences is presented, which extracts a small but meaningful set of video frames using a content-based sampling algorithm. The proposed video-content representation provides the capability of browsing digital stereoscopic video sequences and performing more efficient content-based queries and indexing. Each stereoscopic video sequence is first partitioned into shots by applying a shot-cut detection algorithm so that frames (or stereo pairs) of similar visual characteristics are gathered together. Each shot is then analyzed using stereo-imaging techniques, and the disparity field, occluded areas, and depth map are estimated. A multiresolution implementation of the recursive shortest spanning tree (RSST) algorithm is applied for color and depth segmentation, while fusion of color and depth segments is employed for reliable video object extraction. In particular, color segments are projected onto depth segments so that video objects on the same depth plane are retained, while at the same time accurate object boundaries are extracted. Feature vectors are then constructed using multidimensional fuzzy classification of segment features including size, location, color, and depth. Shot selection is accomplished by clustering similar shots based on the generalized Lloyd-Max algorithm, while for a given shot, key frames are extracted using an optimization method for locating frames of minimally correlated feature vectors. For efficient implementation of the latter method, a genetic algorithm is used. Experimental results are presented, which indicate the reliable performance of the proposed scheme on real-life stereoscopic video sequences
  • Keywords
    content-based retrieval; edge detection; feature extraction; fuzzy logic; genetic algorithms; image classification; image colour analysis; image representation; image resolution; image sampling; image segmentation; image sequences; indexing; pattern clustering; stereo image processing; trees (mathematics); video signal processing; RSST algorithm; browsing; clustering; color segmentation; color segments; content-based queries; content-based sampling algorithm; depth map; depth segmentation; depth segments; digital stereoscopic video sequences; disparity field; feature vectors; fusion; generalized Lloyd-Max algorithm; genetic algorithm; indexing; location; multidimensional fuzzy classification; multiresolution implementation; object boundaries; occluded areas; optimization method; real-life stereoscopic video sequences; recursive shortest spanning tree algorithm; reliable video object extraction; shot selection; shot-cut detection algorithm; size; stereo pairs; stereo-imaging; stereoscopic video sequences; summarization; video frames; video-content representation; Clustering algorithms; Detection algorithms; Feature extraction; Genetic algorithms; Indexing; Multidimensional systems; Optimization methods; Partitioning algorithms; Sampling methods; Video sequences;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/76.844996
  • Filename
    844996