Hand-Crafted Features or Machine Learnt Features? Together They Improve RGB-D Object Recognition

Author

Lu Jin ; Shenghua Gao ; Zechao Li ; Jinhui Tang

Author_Institution

Sch. of Comput. Sci. & Eng., Nanjing Univ. of Sci. & Technol., Nanjing, China

fYear

2014

fDate

10-12 Dec. 2014

Firstpage

311

Lastpage

319

Abstract

RGB-D object recognition is an important research topic in computer version, and seeking a robust image representation is the most important sub problem for RGB-D object recognition. On the one hand, the recently emerging deep learning methods, which learns image representations automatically by capturing the data structure, have demonstrated the impressive performance for object recognition. On the other hand, the previously commonly used hand-crafted features also encodes the prior knowledge about the data. By realizing that the hand-crafted features and machine learnt features actually characterize the different aspects of image data, rather than only using one type of feature, we propose to jointly use the machine learnt features and hand-crafted features for RGB-D object recognition. Specifically, we use the Convolution Neural Networks (CNNs) to extract the machine learnt representation, and use Locality-constrained Linear Coding (LLC) based spatial pyramid matching for hand-crafted features. We evaluated our proposed approach on three publicly available RGB-D datasets. Experimental results show that our method achieves the best performance under all the cases, which demonstrates the effectiveness of our method.

Keywords

computer vision; feature extraction; feedforward neural nets; image coding; image colour analysis; image matching; image representation; learning (artificial intelligence); object recognition; CNN; LLC based spatial pyramid matching; RGB-D object recognition improvement; computer version; convolution neural networks; data structure; deep learning methods; hand-crafted features; locality-constrained linear coding based spatial pyramid matching; machine learnt features; machine learnt representation extraction; robust image representation; Convolutional codes; Encoding; Feature extraction; Image coding; Image representation; Kernel; Object recognition; CNNs; Hand-crafted feature; LLC; Machine learnt features; RGB-D object recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia (ISM), 2014 IEEE International Symposium on

Conference_Location

Taichung

Print_ISBN

978-1-4799-4312-8

Type

conf

DOI

10.1109/ISM.2014.56

Filename

7033044