مرکز منطقه ای اطلاع رساني علوم و فناوري - Detailed 3D Representations for Object Recognition and Modeling

DocumentCode :

742196

Title :

Detailed 3D Representations for Object Recognition and Modeling

Author :

Zia, M. Zeeshan ; Stark, Michael ; Schiele, Bernt ; Schindler, Kaspar

Author_Institution :

Photogrammetry & Remote Sensing Lab., ETH Zurich, Zurich, Switzerland

Volume :

Issue :

fYear :

2013

Firstpage :

2608

Lastpage :

2623

Abstract :

Geometric 3D reasoning at the level of objects has received renewed attention recently in the context of visual scene understanding. The level of geometric detail, however, is typically limited to qualitative representations or coarse boxes. This is linked to the fact that today´s object class detectors are tuned toward robust 2D matching rather than accurate 3D geometry, encouraged by bounding-box-based benchmarks such as Pascal VOC. In this paper, we revisit ideas from the early days of computer vision, namely, detailed, 3D geometric object class representations for recognition. These representations can recover geometrically far more accurate object hypotheses than just bounding boxes, including continuous estimates of object pose and 3D wireframes with relative 3D positions of object parts. In combination with robust techniques for shape description and inference, we outperform state-of-the-art results in monocular 3D pose estimation. In a series of experiments, we analyze our approach in detail and demonstrate novel applications enabled by such an object class representation, such as fine-grained categorization of cars and bicycles, according to their 3D geometry, and ultrawide baseline matching.

Keywords :

computational geometry; computer vision; image matching; image reconstruction; image representation; inference mechanisms; object recognition; pose estimation; solid modelling; 3D geometric object class representations; 3D positions; 3D wireframes; bounding-box-based benchmarks; computer vision; fine-grained categorization; geometric 3D reasoning; geometric detail; image 3D reconstruction; inference; monocular 3D pose estimation; object class detectors; object modeling; object pose estimation; object recognition; robust 2D matching; shape description; visual scene understanding; Computational modeling; Design automation; Detectors; Geometry; Shape; Solid modeling; Three-dimensional displays; 3D representation; recognition; scene understanding; single image 3D reconstruction; ultrawide baseline matching; Algorithms; Artificial Intelligence; Computer Simulation; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Models, Theoretical; Pattern Recognition, Automated; Photography;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/TPAMI.2013.87

Filename :

6516504

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=742196