Title :
Are Cars Just 3D Boxes? Jointly Estimating the 3D Shape of Multiple Objects
Author :
Zia, M. Zeeshan ; Stark, Michael ; Schindler, Kaspar
Author_Institution :
Photogrammetry & Remote Sensing, ETH Zurich, Zurich, Switzerland
Abstract :
Current systems for scene understanding typically represent objects as 2D or 3D bounding boxes. While these representations have proven robust in a variety of applications, they provide only coarse approximations to the true 2D and 3D extent of objects. As a result, object-object interactions, such as occlusions or ground-plane contact, can be represented only superficially. In this paper, we approach the problem of scene understanding from the perspective of 3D shape modeling, and design a 3D scene representation that reasons jointly about the 3D shape of multiple objects. This representation allows to express 3D geometry and occlusion on the fine detail level of individual vertices of 3D wireframe models, and makes it possible to treat dependencies between objects, such as occlusion reasoning, in a deterministic way. In our experiments, we demonstrate the benefit of jointly estimating the 3D shape of multiple objects in a scene over working with coarse boxes, on the recently proposed KITTI dataset of realistic street scenes.
Keywords :
image representation; object detection; shape recognition; 3D geometry; 3D scene representation; 3D shape modeling; 3D wireframe models; ground-plane contact; multiple objects 3D shape estimation; object-object interactions; occlusion reasoning; realistic street scenes; Cognition; Detectors; Estimation; Geometry; Shape; Solid modeling; Three-dimensional displays; 3D object recognition; Scene understanding;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
DOI :
10.1109/CVPR.2014.470