Designing deep networks for surface normal estimation

Author

Xiaolong Wang;David F. Fouhey;Abhinav Gupta

Author_Institution

Robotics Institute, Carnegie Mellon University, USA

fYear

2015

fDate

6/1/2015 12:00:00 AM

Firstpage

539

Lastpage

547

Abstract

In the past few years, convolutional neural nets (CNN) have shown incredible promise for learning visual representations. In this paper, we use CNNs for the task of predicting surface normals from a single image. But what is the right architecture? We propose to build upon the decades of hard work in 3D scene understanding to design a new CNN architecture for the task of surface normal estimation. We show that incorporating several constraints (man-made, Manhattan world) and meaningful intermediate representations (room layout, edge labels) in the architecture leads to state of the art performance on surface normal estimation. We also show that our network is quite robust and show state of the art results on other datasets as well without any fine-tuning.

Keywords

"Layout","Estimation","Image edge detection","Convolutional codes","Three-dimensional displays","Neurons","Surface treatment"

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on

Electronic_ISBN

1063-6919

Type

conf

DOI

10.1109/CVPR.2015.7298652

Filename

7298652

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3672116