• DocumentCode
    2508154
  • Title

    Three-layer Spatial Sparse Coding for Image Classification

  • Author

    Dai, Dengxin ; Yang, Wen ; Wu, Tianfu

  • Author_Institution
    Signal Process. Lab., Wuhan Univ., Wuhan, China
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    613
  • Lastpage
    616
  • Abstract
    In this paper, we propose a three-layer spatial sparse coding (TSSC) for image classification, aiming at three objectives: naturally recognizing image categories without learning phase, naturally involving spatial configurations of images, and naturally counteracting the intra-class variances. The method begins by representing the test images in a spatial pyramid as the to-be-recovered signals, and taking all sampled image patches at multiple scales from the labeled images as the bases. Then, three sets of coefficients are involved into the cardinal sparse coding to get the TSSC, one to penalize spatial inconsistencies of the pyramid cells and the corresponding selected bases, one to guarantee the sparsity of selected images, and the other to guarantee the sparsity of selected categories. Finally, the test images are classified according to a simple image-to-category similarity defined on the coding coefficients. In experiments, we test our method on two publicly available datasets and achieve significantly more accurate results than the conventional sparse coding with only a modest increase in computational complexity.
  • Keywords
    image classification; image coding; TSSC; image classification; image-to-category similarity; spatial configuration; spatial pyramid; three-layer spatial sparse coding; Artificial neural networks; Encoding; Image coding; Image reconstruction; Minimization; Pixel; Visualization; image classification; sparse coding; three-layer spatial sparse coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.155
  • Filename
    5597454