DocumentCode :
2920917
Title :
Learning image representations from the pixel level via hierarchical sparse coding
Author :
Yu, Kai ; Lin, Yuanqing ; Lafferty, John
Author_Institution :
NEC Labs. America, Cupertino, CA, USA
fYear :
2011
fDate :
20-25 June 2011
Firstpage :
1713
Lastpage :
1720
Abstract :
We present a method for learning image representations using a two-layer sparse coding scheme at the pixel level. The first layer encodes local patches of an image. After pooling within local regions, the first layer codes are then passed to the second layer, which jointly encodes signals from the region. Unlike traditional sparse coding methods that encode local patches independently, this approach accounts for high-order dependency among patterns in a local image neighborhood. We develop algorithms for data encoding and codebook learning, and show in experiments that the method leads to more invariant and discriminative image representations. The algorithm gives excellent results for hand-written digit recognition on MNIST and object recognition on the Caltech101 benchmark. This marks the first time that such accuracies have been achieved using automatically learned features from the pixel level, rather than using hand-designed descriptors.
Keywords :
feature extraction; handwritten character recognition; image coding; image representation; object recognition; CaltechlOl benchmark; MNIST; codebook learning; data encoding; hand-designed descriptors; hand-written digit recognition; high-order dependency; image representation learning; local image neighborhood; object recognition; two-layer sparse coding scheme; Convolution; Encoding; Image representation; Optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4577-0394-2
Type :
conf
DOI :
10.1109/CVPR.2011.5995732
Filename :
5995732
Link To Document :
بازگشت