Are Cars Just 3D Boxes? Jointly Estimating the 3D Shape of Multiple Objects

Author

Zia, M. Zeeshan ; Stark, Michael ; Schindler, Kaspar

Author_Institution

Photogrammetry & Remote Sensing, ETH Zurich, Zurich, Switzerland

fYear

2014

fDate

23-28 June 2014

Firstpage

3678

Lastpage

3685

Abstract

Current systems for scene understanding typically represent objects as 2D or 3D bounding boxes. While these representations have proven robust in a variety of applications, they provide only coarse approximations to the true 2D and 3D extent of objects. As a result, object-object interactions, such as occlusions or ground-plane contact, can be represented only superficially. In this paper, we approach the problem of scene understanding from the perspective of 3D shape modeling, and design a 3D scene representation that reasons jointly about the 3D shape of multiple objects. This representation allows to express 3D geometry and occlusion on the fine detail level of individual vertices of 3D wireframe models, and makes it possible to treat dependencies between objects, such as occlusion reasoning, in a deterministic way. In our experiments, we demonstrate the benefit of jointly estimating the 3D shape of multiple objects in a scene over working with coarse boxes, on the recently proposed KITTI dataset of realistic street scenes.

Keywords

image representation; object detection; shape recognition; 3D geometry; 3D scene representation; 3D shape modeling; 3D wireframe models; ground-plane contact; multiple objects 3D shape estimation; object-object interactions; occlusion reasoning; realistic street scenes; Cognition; Detectors; Estimation; Geometry; Shape; Solid modeling; Three-dimensional displays; 3D object recognition; Scene understanding;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on

Conference_Location

Columbus, OH

Type

conf

DOI

10.1109/CVPR.2014.470

Filename

6909865