• DocumentCode
    2719059
  • Title

    Beyond spatial pyramids: Receptive field learning for pooled image features

  • Author

    Yangqing Jia ; Chang Huang ; Darrell, Trevor

  • Author_Institution
    UC Berkeley EECS & ICSI, Berkeley, CA, USA
  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    3370
  • Lastpage
    3377
  • Abstract
    In this paper we examine the effect of receptive field designs on classification accuracy in the commonly adopted pipeline of image classification. While existing algorithms usually use manually defined spatial regions for pooling, we show that learning more adaptive receptive fields increases performance even with a significantly smaller codebook size at the coding layer. To learn the optimal pooling parameters, we adopt the idea of over-completeness by starting with a large number of receptive field candidates, and train a classifier with structured sparsity to only use a sparse subset of all the features. An efficient algorithm based on incremental feature selection and retraining is proposed for fast learning. With this method, we achieve the best published performance on the CIFAR-10 dataset, using a much lower dimensional feature space than previous methods.
  • Keywords
    feature extraction; image classification; image coding; learning (artificial intelligence); CIFAR-10 dataset; adaptive receptive field learning; codebook size; image classification; incremental feature selection; pooled image features; pooling parameter; receptive field design; retraining; spatial pyramids; Accuracy; Dictionaries; Encoding; Image coding; Pipelines; Training data; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6248076
  • Filename
    6248076