DocumentCode :
742196
Title :
Detailed 3D Representations for Object Recognition and Modeling
Author :
Zia, M. Zeeshan ; Stark, Michael ; Schiele, Bernt ; Schindler, Kaspar
Author_Institution :
Photogrammetry & Remote Sensing Lab., ETH Zurich, Zurich, Switzerland
Volume :
35
Issue :
11
fYear :
2013
Firstpage :
2608
Lastpage :
2623
Abstract :
Geometric 3D reasoning at the level of objects has received renewed attention recently in the context of visual scene understanding. The level of geometric detail, however, is typically limited to qualitative representations or coarse boxes. This is linked to the fact that today´s object class detectors are tuned toward robust 2D matching rather than accurate 3D geometry, encouraged by bounding-box-based benchmarks such as Pascal VOC. In this paper, we revisit ideas from the early days of computer vision, namely, detailed, 3D geometric object class representations for recognition. These representations can recover geometrically far more accurate object hypotheses than just bounding boxes, including continuous estimates of object pose and 3D wireframes with relative 3D positions of object parts. In combination with robust techniques for shape description and inference, we outperform state-of-the-art results in monocular 3D pose estimation. In a series of experiments, we analyze our approach in detail and demonstrate novel applications enabled by such an object class representation, such as fine-grained categorization of cars and bicycles, according to their 3D geometry, and ultrawide baseline matching.
Keywords :
computational geometry; computer vision; image matching; image reconstruction; image representation; inference mechanisms; object recognition; pose estimation; solid modelling; 3D geometric object class representations; 3D positions; 3D wireframes; bounding-box-based benchmarks; computer vision; fine-grained categorization; geometric 3D reasoning; geometric detail; image 3D reconstruction; inference; monocular 3D pose estimation; object class detectors; object modeling; object pose estimation; object recognition; robust 2D matching; shape description; visual scene understanding; Computational modeling; Design automation; Detectors; Geometry; Shape; Solid modeling; Three-dimensional displays; 3D representation; recognition; scene understanding; single image 3D reconstruction; ultrawide baseline matching; Algorithms; Artificial Intelligence; Computer Simulation; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Models, Theoretical; Pattern Recognition, Automated; Photography;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2013.87
Filename :
6516504
Link To Document :
بازگشت