Title :
Content-Based 3-D Mosaics for Representing Videos of Dynamic Urban Scenes
Author :
Tang, Hao ; Zhu, Zhigang
Author_Institution :
Dept. of Comput. Sci., City Univ. of New York, New York, NY, USA
Abstract :
We propose a content-based 3-D mosaic (CB3M) representation for long video sequences of 3-D and dynamic urban scenes captured by a camera on a mobile platform. In the first phase, a set of parallel-perspective (pushbroom) mosaics with varying viewing directions is generated to capture both the 3-D and dynamic aspects of the scene under the camera coverage. In the second phase, a segmentation-based stereo matching algorithm is applied to extract parametric representations of the color, structure and motion of the dynamic and/or 3-D objects in urban scenes, where a lot of planar surfaces exist. Multiple pairs of stereo mosaics are used for facilitating reliable stereo matching, occlusion handling, accurate 3-D reconstruction, and robust moving target detection. CB3M is a highly compressed visual representation for a dynamic 3-D scene, and has object contents of both 3-D and motion information. Experimental results are given for various real video sequences of large-scale 3-D scenes.
Keywords :
image colour analysis; image matching; image motion analysis; image reconstruction; image representation; image segmentation; image sequences; realistic images; stereo image processing; video cameras; video coding; 3D objects; CB3M representation; accurate 3D reconstruction; camera coverage; color; compressed visual representation; content-based 3D mosaics; dynamic 3D scene; dynamic urban scenes; large-scale 3D scenes; mobile platform; motion information; object contents; occlusion handling; parallel-perspective mosaics; parametric representations; planar surfaces; pushbroom mosaics; real video sequences; reliable stereo matching; representing videos; robust moving target detection; segmentation-based stereo matching algorithm; stereo mosaics; Cameras; Dynamics; Geometry; Stereo image processing; Three dimensional displays; Videos; 3-D scene representation; content-based video coding; image-based modeling; multi-image registration;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2011.2178729