مرکز منطقه ای اطلاع رساني علوم و فناوري - A multimodal approach for image de-fencing and depth inpainting

DocumentCode :

1864746

Title :

A multimodal approach for image de-fencing and depth inpainting

Author :

Jonna, Sankaraganesh ; Voleti, Vikram S. ; Sahay, Rajiv R. ; Kankanhalli, Mohan S.

Author_Institution :

Dept. of Electr. Eng., IIT Kharagpur, Kharagpur, India

fYear :

2015

fDate :

4-7 Jan. 2015

Firstpage :

Lastpage :

Abstract :

Low cost RGB-D sensors such as the Microsoft Kinect have enabled the use of depth data along with color images. In this work, we propose a multi-modal approach to address the problem of removal of fences/occlusions from images captured using a Kinect camera. We also perform depth completion by fusing data from multiple recorded depth maps affected by occlusions. The availability of aligned image and depth data from Kinect aids us in the detection of the fence locations. However, accurate estimation of the relative shifts between the captured color frames is necessary. Initially, for the case of static scene elements with simple relative motion between the camera and the objects, we propose the use of affine scale-invariant feature transform descriptor (ASIFT) to compute the relative global displacements. We also address the scenario wherein the relative motion between the frames may not be global using the depth map obtained by Kinect. For such a scenario involving complex motion of scene pixels, we use a recently proposed robust optical flow technique. We show results for challenging real-world data wherein the scene is dynamic. The inverse ill-posed problems of estimation of the de-fenced image and the inpainted depth map are solved using an optimization-based framework. Specifically, we model the unoccluded image and the completed depth map as two distinct Markov random fields, respectively, and obtain their maximum a-posteriori estimates using loopy belief propagation.

Keywords :

Markov processes; affine transforms; image capture; image colour analysis; image fusion; image motion analysis; image sensors; image sequences; maximum likelihood estimation; optimisation; random processes; ASIFT; Markov random fields; Microsoft Kinect camera; affine scale-invariant feature transform descriptor; color frames; color images; data fusion; depth data; depth inpainting; fence location detection; image data; image defencing; loopy belief propagation; low cost RGB-D sensors; maximum a-posteriori estimates; multimodal approach; optimization-based framework; recorded depth maps; robust optical flow technique; scene pixels; static scene elements; Cameras; Image color analysis; Image reconstruction; Optical imaging; Optical sensors; Videos; Belief propagation; Image de-fencing; Inpainting; Kinect; Markov random field; RGB-D data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on

Conference_Location :

Kolkata

Type :

conf

DOI :

10.1109/ICAPR.2015.7050696

Filename :

7050696

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1864746