DocumentCode :
3019074
Title :
3D Pose Estimation Based on Multiple Monocular Cues
Author :
Barrois, Björn ; Wöhler, Christian
Author_Institution :
DaimlerChrysler Group Res., Ulm
fYear :
2007
fDate :
17-22 June 2007
Firstpage :
1
Lastpage :
8
Abstract :
In this study we propose an integrated approach to the problem of 3D pose estimation. The main difference to the majority of known methods is the usage of complementary image information, including intensity and polarisation state of the light reflected from the object surface, edge information, and absolute depth values obtained based on a depth from defocus approach. Our method is based on the comparison of the input image to synthetic images generated by an OpenGL-based renderer using model information about the object provided by CAD data. This comparison provides an error term which is minimised by an iterative optimisation algorithm. Although all six degrees of freedom are estimated, our method requires only a monocular camera, circumventing disadvantages of multiocular camera systems such as the need for external camera calibration. Our framework is open for the inclusion of independently acquired depth data. We evaluate our method on a toy example as well as in two realistic scenarios in the domain of industrial quality inspection. Our experiments regarding complex real-world objects located at a distance of about 0.5 m to the camera show that the algorithm achieves typical accuracies of better than 1 degree for the rotation angles, 1-2 image pixels for the lateral translations, and several millimetres or about 1 percent for the object distance.
Keywords :
image resolution; iterative methods; object detection; optimisation; rendering (computer graphics); 3D pose estimation; CAD data; OpenGL-based renderer; complementary image information; edge information; image pixels; industrial quality inspection; iterative optimisation algorithm; multiocular camera systems; multiple monocular cues; object surface; synthetic images; Calibration; Cameras; Computer vision; Image generation; Iterative algorithms; Optical polarization; Reflectivity; Rendering (computer graphics); Solid modeling; Toy industry;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
Conference_Location :
Minneapolis, MN
ISSN :
1063-6919
Print_ISBN :
1-4244-1179-3
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2007.383352
Filename :
4270350
Link To Document :
بازگشت