• DocumentCode
    742196
  • Title

    Detailed 3D Representations for Object Recognition and Modeling

  • Author

    Zia, M. Zeeshan ; Stark, Michael ; Schiele, Bernt ; Schindler, Kaspar

  • Author_Institution
    Photogrammetry & Remote Sensing Lab., ETH Zurich, Zurich, Switzerland
  • Volume
    35
  • Issue
    11
  • fYear
    2013
  • Firstpage
    2608
  • Lastpage
    2623
  • Abstract
    Geometric 3D reasoning at the level of objects has received renewed attention recently in the context of visual scene understanding. The level of geometric detail, however, is typically limited to qualitative representations or coarse boxes. This is linked to the fact that today´s object class detectors are tuned toward robust 2D matching rather than accurate 3D geometry, encouraged by bounding-box-based benchmarks such as Pascal VOC. In this paper, we revisit ideas from the early days of computer vision, namely, detailed, 3D geometric object class representations for recognition. These representations can recover geometrically far more accurate object hypotheses than just bounding boxes, including continuous estimates of object pose and 3D wireframes with relative 3D positions of object parts. In combination with robust techniques for shape description and inference, we outperform state-of-the-art results in monocular 3D pose estimation. In a series of experiments, we analyze our approach in detail and demonstrate novel applications enabled by such an object class representation, such as fine-grained categorization of cars and bicycles, according to their 3D geometry, and ultrawide baseline matching.
  • Keywords
    computational geometry; computer vision; image matching; image reconstruction; image representation; inference mechanisms; object recognition; pose estimation; solid modelling; 3D geometric object class representations; 3D positions; 3D wireframes; bounding-box-based benchmarks; computer vision; fine-grained categorization; geometric 3D reasoning; geometric detail; image 3D reconstruction; inference; monocular 3D pose estimation; object class detectors; object modeling; object pose estimation; object recognition; robust 2D matching; shape description; visual scene understanding; Computational modeling; Design automation; Detectors; Geometry; Shape; Solid modeling; Three-dimensional displays; 3D representation; recognition; scene understanding; single image 3D reconstruction; ultrawide baseline matching; Algorithms; Artificial Intelligence; Computer Simulation; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Models, Theoretical; Pattern Recognition, Automated; Photography;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.87
  • Filename
    6516504