• DocumentCode
    3402502
  • Title

    Single image depth estimation from predicted semantic labels

  • Author

    Liu, Beyang ; Gould, Stephen ; Koller, Daphne

  • Author_Institution
    Dept. of Comput. Sci., Stanford Univ., Stanford, CA, USA
  • fYear
    2010
  • fDate
    13-18 June 2010
  • Firstpage
    1253
  • Lastpage
    1260
  • Abstract
    We consider the problem of estimating the depth of each pixel in a scene from a single monocular image. Unlike traditional approaches, which attempt to map from appearance features to depth directly, we first perform a semantic segmentation of the scene and use the semantic labels to guide the 3D reconstruction. This approach provides several advantages: By knowing the semantic class of a pixel or region, depth and geometry constraints can be easily enforced (e.g., “sky” is far away and “ground” is horizontal). In addition, depth can be more readily predicted by measuring the difference in appearance with respect to a given semantic class. For example, a tree will have more uniform appearance in the distance than it does close up. Finally, the incorporation of semantic features allows us to achieve state-of-the-art results with a significantly simpler model than previous works.
  • Keywords
    image reconstruction; image segmentation; 3D reconstruction; geometry constraint; semantic label; semantic segmentation; single image depth estimation; single monocular image; Application software; Computer science; Computer vision; Geometry; Image reconstruction; Image segmentation; Layout; Pixel; Predictive models; Roads;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-6984-0
  • Type

    conf

  • DOI
    10.1109/CVPR.2010.5539823
  • Filename
    5539823