• DocumentCode
    259509
  • Title

    Hand-Crafted Features or Machine Learnt Features? Together They Improve RGB-D Object Recognition

  • Author

    Lu Jin ; Shenghua Gao ; Zechao Li ; Jinhui Tang

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Nanjing Univ. of Sci. & Technol., Nanjing, China
  • fYear
    2014
  • fDate
    10-12 Dec. 2014
  • Firstpage
    311
  • Lastpage
    319
  • Abstract
    RGB-D object recognition is an important research topic in computer version, and seeking a robust image representation is the most important sub problem for RGB-D object recognition. On the one hand, the recently emerging deep learning methods, which learns image representations automatically by capturing the data structure, have demonstrated the impressive performance for object recognition. On the other hand, the previously commonly used hand-crafted features also encodes the prior knowledge about the data. By realizing that the hand-crafted features and machine learnt features actually characterize the different aspects of image data, rather than only using one type of feature, we propose to jointly use the machine learnt features and hand-crafted features for RGB-D object recognition. Specifically, we use the Convolution Neural Networks (CNNs) to extract the machine learnt representation, and use Locality-constrained Linear Coding (LLC) based spatial pyramid matching for hand-crafted features. We evaluated our proposed approach on three publicly available RGB-D datasets. Experimental results show that our method achieves the best performance under all the cases, which demonstrates the effectiveness of our method.
  • Keywords
    computer vision; feature extraction; feedforward neural nets; image coding; image colour analysis; image matching; image representation; learning (artificial intelligence); object recognition; CNN; LLC based spatial pyramid matching; RGB-D object recognition improvement; computer version; convolution neural networks; data structure; deep learning methods; hand-crafted features; locality-constrained linear coding based spatial pyramid matching; machine learnt features; machine learnt representation extraction; robust image representation; Convolutional codes; Encoding; Feature extraction; Image coding; Image representation; Kernel; Object recognition; CNNs; Hand-crafted feature; LLC; Machine learnt features; RGB-D object recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia (ISM), 2014 IEEE International Symposium on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-1-4799-4312-8
  • Type

    conf

  • DOI
    10.1109/ISM.2014.56
  • Filename
    7033044