DocumentCode
3707543
Title
Learning depth from a single image using visual-depth words
Author
Sunok Kim;Sunghwan Choi;Kwanghoon Sohn
Author_Institution
School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea
fYear
2015
Firstpage
1895
Lastpage
1899
Abstract
Estimating depth from a single monocular image is a fundamental problem in computer vision. Traditional methods for such estimation usually require complicated and sometimes labor-intensive processing. In this paper, we propose a new perspective for this problem and suggest a new gradient-domain learning framework which is much simpler and more efficient. Inspired by the observation that there is substantial co-occurrence of image edges and depth discontinuities in natural scenes, we learn the relationship between local appearance features and corresponding depth gradients by making use of the K-means clustering algorithm within the image feature space. We then encode each cluster centroid with its associated depth gradients, which defines visual-depth words that model the image-depth relationship very well. This enables one to estimate the scene depth for an arbitrary image by simply selecting proper depth gradients from a compact dictionary of visual-depth words, followed by a Poisson surface reconstruction. Experimental results demonstrate that the proposed gradient-domain approach outperforms state-of-the-art methods both qualitatively and quantitatively and is generic over (unseen) scene categories which are not used for training.
Keywords
"Image edge detection","Training","Image reconstruction","Color","Dictionaries","Image resolution","Surface reconstruction"
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/ICIP.2015.7351130
Filename
7351130
Link To Document