• DocumentCode
    3672479
  • Title

    Holistic 3D scene understanding from a single geo-tagged image

  • Author

    Shenlong Wang;Sanja Fidler;Raquel Urtasun

  • Author_Institution
    Department of Computer Science, University of Toronto, ON M5S, Canada
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    3964
  • Lastpage
    3972
  • Abstract
    In this paper we are interested in exploiting geographic priors to help outdoor scene understanding. Towards this goal we propose a holistic approach that reasons jointly about 3D object detection, pose estimation, semantic segmentation as well as depth reconstruction from a single image. Our approach takes advantage of large-scale crowd-sourced maps to generate dense geographic, geometric and semantic priors by rendering the 3D world. We demonstrate the effectiveness of our holistic model on the challenging KITTI dataset [13], and show significant improvements over the baselines in all metrics and tasks.
  • Keywords
    "Three-dimensional displays","Semantics","Solid modeling","Image reconstruction","Design automation","Object detection","Context"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2015.7299022
  • Filename
    7299022