DocumentCode
716341
Title
RGB-D object recognition and pose estimation based on pre-trained convolutional neural network features
Author
Schwarz, Max ; Schulz, Hannes ; Behnke, Sven
Author_Institution
Comput. Sci. Inst. VI, Rheinische Friedrich-Wilhelms-Univ. Bonn, Bonn, Germany
fYear
2015
fDate
26-30 May 2015
Firstpage
1329
Lastpage
1335
Abstract
Object recognition and pose estimation from RGB-D images are important tasks for manipulation robots which can be learned from examples. Creating and annotating datasets for learning is expensive, however. We address this problem with transfer learning from deep convolutional neural networks (CNN) that are pre-trained for image categorization and provide a rich, semantically meaningful feature set. We incorporate depth information, which the CNN was not trained with, by rendering objects from a canonical perspective and colorizing the depth channel according to distance from the object center. We evaluate our approach on the Washington RGB-D Objects dataset, where we find that the generated feature set naturally separates classes and instances well and retains pose manifolds. We outperform state-of-the-art on a number of subtasks and show that our approach can yield superior results when only little training data is available.
Keywords
learning (artificial intelligence); manipulators; neural nets; object recognition; pose estimation; visual databases; CNN; RGB-D images; Washington RGB-D objects dataset; canonical perspective; convolutional neural network features; depth channel; image categorization; manipulation robots; object center; object recognition; pose estimation; pose manifolds; transfer learning; Accuracy; Estimation; Feature extraction; Image color analysis; Pipelines; Support vector machines; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location
Seattle, WA
Type
conf
DOI
10.1109/ICRA.2015.7139363
Filename
7139363
Link To Document