Title :
Integrating supervised subspace criteria with restricted Boltzmann Machine for feature extraction
Author :
Guo-Sen Xie ; Xu-Yao Zhang ; Yan-Ming Zhang ; Cheng-Lin Liu
Author_Institution :
Nat. Lab. of Pattern Recognition (NLPR), Inst. of Autom., Beijing, China
Abstract :
Restricted Boltzmann Machine (RBM) is a widely used building-block in deep neural networks. However, RBM is an unsupervised model which can not exploit the rich supervised information of data. Therefore, we consider combining the descriptive (generative) ability of RBM with the discriminative ability of supervised subspace models, i.e., Fisher linear discriminant analysis (FDA), marginal Fisher analysis (MFA), and heat kernel MFA (hkMFA). Specifically, the hidden layer of RBM is regularized by the supervised subspace criteria, and the joint learning model can then be efficiently optimized by gradient descent and graph construction (used to define the scatter matrix in the subspace models) on mini-batch data. Compared with the traditional subspace models (FDA, MFA, hkMFA), the proposed hybrid models are essentially nonlinear and can be optimized by gradient descent instead of eigenvalue decomposition. More importantly, traditional subspace models can only reduce the dimensionality (because of linear transformation), while the proposed models can also increase the dimensionality for better class discrimination. Experiments on three databases demonstrate that the proposed hybrid models outperform both RBM and their counterpart subspace models (FDA, MFA, hkMFA) consistently.
Keywords :
Boltzmann machines; eigenvalues and eigenfunctions; feature extraction; gradient methods; learning (artificial intelligence); FDA; Fisher linear discriminant analysis; RBM; databases; deep neural networks; dimensionality reduction; eigenvalue decomposition; feature extraction; gradient descent; graph construction; heat kernel MFA; hkMFA; joint learning model; marginal Fisher analysis; mini-batch data; restricted Boltzmann machine; rich supervised information; subspace models; supervised subspace criteria; Analytical models; Data models; Eigenvalues and eigenfunctions; Feature extraction; Joints; Mathematical model; Training;
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
DOI :
10.1109/IJCNN.2014.6889447