DocumentCode :
3605234
Title :
PCANet: A Simple Deep Learning Baseline for Image Classification?
Author :
Tsung-Han Chan ; Kui Jia ; Shenghua Gao ; Jiwen Lu ; Zinan Zeng ; Yi Ma
Author_Institution :
MediaTek Inc., Hsinchu, Taiwan
Volume :
24
Issue :
12
fYear :
2015
Firstpage :
5017
Lastpage :
5032
Abstract :
In this paper, we propose a very simple deep learning network for image classification that is based on very basic data processing components: 1) cascaded principal component analysis (PCA); 2) binary hashing; and 3) blockwise histograms. In the proposed architecture, the PCA is employed to learn multistage filter banks. This is followed by simple binary hashing and block histograms for indexing and pooling. This architecture is thus called the PCA network (PCANet) and can be extremely easily and efficiently designed and learned. For comparison and to provide a better understanding, we also introduce and study two simple variations of PCANet: 1) RandNet and 2) LDANet. They share the same topology as PCANet, but their cascaded filters are either randomly selected or learned from linear discriminant analysis. We have extensively tested these basic networks on many benchmark visual data sets for different tasks, including Labeled Faces in the Wild (LFW) for face verification; the MultiPIE, Extended Yale B, AR, Facial Recognition Technology (FERET) data sets for face recognition; and MNIST for hand-written digit recognition. Surprisingly, for all tasks, such a seemingly naive PCANet model is on par with the state-of-the-art features either prefixed, highly hand-crafted, or carefully learned [by deep neural networks (DNNs)]. Even more surprisingly, the model sets new records for many classification tasks on the Extended Yale B, AR, and FERET data sets and on MNIST variations. Additional experiments on other public data sets also demonstrate the potential of PCANet to serve as a simple but highly competitive baseline for texture classification and object recognition.
Keywords :
channel bank filters; face recognition; handwriting recognition; image classification; image texture; learning (artificial intelligence); neural nets; object recognition; principal component analysis; AR; DNN; FERET data sets; LDANet; LFW; MNIST; PCA network; PCANet; RandNet; binary hashing; blockwise histograms; cascaded principal component analysis; data processing components; deep learning baseline; deep neural networks; extended Yale B; face verification; facial recognition technology; handwritten digit recognition; image classification; labeled faces; linear discriminant analysis; multiPIE; multistage filter banks; object recognition; public data sets; texture classification; visual data sets; wild; Face; Face recognition; Feature extraction; Histograms; Machine learning; Principal component analysis; Training; Convolution Neural Network; Convolution neural network; Deep Learning; Face Recognition; Handwritten Digit Recognition; LDA Network; LDA network; Object Classification; PCA Network; PCA network; Random Network; deep learning; face recognition; handwritten digit recognition; object classification; random network;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2015.2475625
Filename :
7234886
Link To Document :
بازگشت