Title :
Optimizing multilayer perceptrons by discriminatory component analysis
Author :
Wang, Yue ; Wang, Zuyi ; Xuan, Jianhua ; Zhang, Junying ; Hoffman, Eric P. ; Clarke, Robert ; Khan, Javed
Author_Institution :
Dept. of Electr. & Comput. Eng., Virginia Polytech Inst. & State Univ., Alexandria, VA
fDate :
Sept. 29 2004-Oct. 1 2004
Abstract :
Multilayer perceptrons offer an integrated procedure for feature extraction and Bayes classification by learning the decision boundary. Its feedforward autoassociative architecture can also be used to construct subspaces in a supervised or unsupervised model [A.K. Jain et al., 2000]. On the other hand, multiclass linear discriminant analysis provides a multivariate prediction by estimating the density function. Its linear subspaces obtained by the weighted Fisher criteria under a standard finite normal mixture model retain most closely the intrinsic Bayes separability [M. Loog et al., 2001]. Here we show a two-fold connection between multilayer perceptrons and linear discriminant analysis using discriminatory component analysis. This theoretical observation immediately suggests a possible clustering-model supported optimization mechanism for multilayer perceptrons: the weights between the input and hidden layers are related to eigenvectors of the weighted Fisher scatter matrix, the number of the hidden layer neurons is justified by the corresponding significant eigenvalues, and the weights connected to the output neurons are obtained from the centers of the classes in the extracted feature subspaces
Keywords :
Bayes methods; eigenvalues and eigenfunctions; feature extraction; feedforward; learning (artificial intelligence); matrix algebra; multilayer perceptrons; optimisation; pattern classification; Bayes classification; decision boundary learning; density function estimation; discriminatory component analysis; feature extraction; feedforward autoassociative architecture; hidden layer neurons; intrinsic Bayes separability; multiclass linear discriminant analysis; multilayer perceptrons optimization; multivariate prediction; standard finite normal mixture model; weighted Fisher scatter matrix; Density functional theory; Feature extraction; Genetics; Linear discriminant analysis; Multilayer perceptrons; Neurons; Oncology; Pediatrics; Physiology; Scattering;
Conference_Titel :
Machine Learning for Signal Processing, 2004. Proceedings of the 2004 14th IEEE Signal Processing Society Workshop
Conference_Location :
Sao Luis
Print_ISBN :
0-7803-8608-4
DOI :
10.1109/MLSP.2004.1422983