Title :
Beyond spatial pyramids: Receptive field learning for pooled image features
Author :
Yangqing Jia ; Chang Huang ; Darrell, Trevor
Author_Institution :
UC Berkeley EECS & ICSI, Berkeley, CA, USA
Abstract :
In this paper we examine the effect of receptive field designs on classification accuracy in the commonly adopted pipeline of image classification. While existing algorithms usually use manually defined spatial regions for pooling, we show that learning more adaptive receptive fields increases performance even with a significantly smaller codebook size at the coding layer. To learn the optimal pooling parameters, we adopt the idea of over-completeness by starting with a large number of receptive field candidates, and train a classifier with structured sparsity to only use a sparse subset of all the features. An efficient algorithm based on incremental feature selection and retraining is proposed for fast learning. With this method, we achieve the best published performance on the CIFAR-10 dataset, using a much lower dimensional feature space than previous methods.
Keywords :
feature extraction; image classification; image coding; learning (artificial intelligence); CIFAR-10 dataset; adaptive receptive field learning; codebook size; image classification; incremental feature selection; pooled image features; pooling parameter; receptive field design; retraining; spatial pyramids; Accuracy; Dictionaries; Encoding; Image coding; Pipelines; Training data; Vectors;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
DOI :
10.1109/CVPR.2012.6248076