DocumentCode
2719059
Title
Beyond spatial pyramids: Receptive field learning for pooled image features
Author
Yangqing Jia ; Chang Huang ; Darrell, Trevor
Author_Institution
UC Berkeley EECS & ICSI, Berkeley, CA, USA
fYear
2012
fDate
16-21 June 2012
Firstpage
3370
Lastpage
3377
Abstract
In this paper we examine the effect of receptive field designs on classification accuracy in the commonly adopted pipeline of image classification. While existing algorithms usually use manually defined spatial regions for pooling, we show that learning more adaptive receptive fields increases performance even with a significantly smaller codebook size at the coding layer. To learn the optimal pooling parameters, we adopt the idea of over-completeness by starting with a large number of receptive field candidates, and train a classifier with structured sparsity to only use a sparse subset of all the features. An efficient algorithm based on incremental feature selection and retraining is proposed for fast learning. With this method, we achieve the best published performance on the CIFAR-10 dataset, using a much lower dimensional feature space than previous methods.
Keywords
feature extraction; image classification; image coding; learning (artificial intelligence); CIFAR-10 dataset; adaptive receptive field learning; codebook size; image classification; incremental feature selection; pooled image features; pooling parameter; receptive field design; retraining; spatial pyramids; Accuracy; Dictionaries; Encoding; Image coding; Pipelines; Training data; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location
Providence, RI
ISSN
1063-6919
Print_ISBN
978-1-4673-1226-4
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2012.6248076
Filename
6248076
Link To Document