DocumentCode :
647429
Title :
Real-time RGB-D camera relocalization
Author :
Glocker, Ben ; Izadi, Shahram ; Shotton, Jamie ; Criminisi, Antonio
Author_Institution :
Microsoft Res., Cambridge, UK
fYear :
2013
fDate :
1-4 Oct. 2013
Firstpage :
173
Lastpage :
179
Abstract :
We introduce an efficient camera relocalization approach which can be easily integrated into real-time 3D reconstruction methods, such as KinectFusion. Our approach makes use of compact encoding of whole image frames which enables both online harvesting of keyframes in tracking mode, and fast retrieval of pose proposals when tracking is lost. The encoding scheme is based on randomized ferns and simple binary feature tests. Each fern generates a small block code, and the concatenation of codes yields a compact representation of each camera frame. Based on those representations we introduce an efficient frame dissimilarity measure which is defined via the block-wise hamming distance (BlockHD). We illustrate how BlockHDs between a query frame and a large set of keyframes can be simultaneously evaluated by traversing the nodes of the ferns and counting image co-occurrences in corresponding code tables. In tracking mode, this mechanism allows us to consider every frame/pose pair as a potential keyframe. A new keyframe is added only if it is sufficiently dissimilar from all previously stored keyframes. For tracking recovery, camera poses are retrieved that correspond to the keyframes with smallest BlockHDs. The pose proposals are then used to reinitialize the tracking algorithm. Harvesting of keyframes and pose retrieval are computationally efficient with only small impact on the run-time performance of the 3D reconstruction. Integrating our relocalization method into KinectFusion allows seamless continuation of mapping even when tracking is frequently lost. Additionally, we demonstrate how marker-free augmented reality, in particular, can benefit from this integration by enabling a smoother and continuous AR experience.
Keywords :
augmented reality; cameras; concatenated codes; image reconstruction; image retrieval; object tracking; pose estimation; real-time systems; BlockHD; KinectFusion; binary feature tests; block code; block-wise hamming distance; code concatenation; code tables; compact encoding; compact representation; encoding scheme; frame dissimilarity measure; image cooccurrence; image frames; marker-free augmented reality; online harvesting; pose proposal retrieval; pose retrieval; query frame; randomized ferns; real-time 3D reconstruction method; real-time RGB-D camera relocalization; stored keyframes; tracking mode; tracking recovery; Cameras; Encoding; Iterative closest point algorithm; Pipelines; Proposals; Real-time systems; Three-dimensional displays;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mixed and Augmented Reality (ISMAR), 2013 IEEE International Symposium on
Conference_Location :
Adelaide, SA
Type :
conf
DOI :
10.1109/ISMAR.2013.6671777
Filename :
6671777
Link To Document :
بازگشت