DocumentCode :
3401824
Title :
Ray Markov Random Fields for image-based 3D modeling: Model and efficient inference
Author :
Liu, Shubao ; Cooper, David B.
Author_Institution :
Lab. for Eng. Man/Machine Syst. (LEMS), Brown Univ., Providence, RI, USA
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
1530
Lastpage :
1537
Abstract :
In this paper, we present an approach to multi-view image-based 3D reconstruction by statistically inversing the ray-tracing based image generation process. The proposed algorithm is fast, accurate and does not need any initialization. The geometric representation is a discrete volume divided into voxels, with each voxel associated with two properties: opacity (shape) and color (appearance). The problem is then formulated as inferring each voxel´s most probable opacity and color through MAP estimation of the developed Ray Markov Random Fields (RayMRF). RayMRF is constructed with three kinds of cliques: the usual unary and pairwise cliques favoring connected voxel regions, and most importantly ray-cliques modelling the ray-tracing based image generation process. Each ray-clique connects the voxels that the viewing ray passes through. It provides a principled way of modeling the occlusion without approximation. The inference problem involved in the MAP estimation is handled by an optimized belief propagation algorithm. One unusual structure of the proposed MRF is that each ray-clique usually involves hundreds/thousands of random variables, which seems to make the inference computationally formidable. Thanks to the special property of the ray-clique functional form, we investigate the deep factorization property of ray-clique energy and get a highly efficient algorithm based on the general loopy belief propagation, which has reduced the computational complexity from exponential to linear. Both of the efficient inference algorithm and the overall system concept are new. Combining these results in an algorithm that can reverse the image generation process very fast. 3D surface reconstruction in a 100×100×100, i.e., 106 voxel space with 10 images requires roughly 3 minutes on a 3.0 GHz single-core CPU. The running time grows linearly with respect to the number of voxels and the number of images. And the speed could be further improved with a hi- - erarchical sparse representation of the volume, like octree. Experiments on several standard datasets show the quality and speed of the proposed models and algorithms.
Keywords :
Markov processes; image colour analysis; image reconstruction; inference mechanisms; maximum likelihood estimation; ray tracing; MAP estimation; deep factorization property; general loopy belief propagation; geometric representation; image-based 3D modeling; inference problem; multiview image-based 3D reconstruction; ray Markov random fields; ray-clique energy; ray-cliques modelling; ray-tracing based image generation process; Belief propagation; Computational complexity; Image generation; Image reconstruction; Inference algorithms; Markov random fields; Random variables; Ray tracing; Rough surfaces; Shape;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
ISSN :
1063-6919
Print_ISBN :
978-1-4244-6984-0
Type :
conf
DOI :
10.1109/CVPR.2010.5539790
Filename :
5539790
Link To Document :
بازگشت