• DocumentCode
    1444748
  • Title

    Feature Extraction With Deep Neural Networks by a Generalized Discriminant Analysis

  • Author

    Stuhlsatz, A. ; Lippel, J. ; Zielke, T.

  • Author_Institution
    R&D Div., SMS Siemag AG, Dusseldorf, Germany
  • Volume
    23
  • Issue
    4
  • fYear
    2012
  • fDate
    4/1/2012 12:00:00 AM
  • Firstpage
    596
  • Lastpage
    608
  • Abstract
    We present an approach to feature extraction that is a generalization of the classical linear discriminant analysis (LDA) on the basis of deep neural networks (DNNs). As for LDA, discriminative features generated from independent Gaussian class conditionals are assumed. This modeling has the advantages that the intrinsic dimensionality of the feature space is bounded by the number of classes and that the optimal discriminant function is linear. Unfortunately, linear transformations are insufficient to extract optimal discriminative features from arbitrarily distributed raw measurements. The generalized discriminant analysis (GerDA) proposed in this paper uses nonlinear transformations that are learnt by DNNs in a semisupervised fashion. We show that the feature extraction based on our approach displays excellent performance on real-world recognition and detection tasks, such as handwritten digit recognition and face detection. In a series of experiments, we evaluate GerDA features with respect to dimensionality reduction, visualization, classification, and detection. Moreover, we show that GerDA DNNs can preprocess truly high-dimensional input data to low-dimensional representations that facilitate accurate predictions even if simple linear predictors or measures of similarity are used.
  • Keywords
    Gaussian distribution; feature extraction; neural nets; LDA; classical linear discriminant analysis; deep neural networks; face detection; feature space intrinsic dimensionality; generalized discriminant analysis; handwritten digit recognition; independent Gaussian class conditionals; nonlinear transformations; optimal discriminant function; optimal discriminative feature extraction; Covariance matrix; Feature extraction; Neural networks; Nickel; Optimization; Training; Visualization; Deep neural networks; dimensionality reduction; discriminant analysis; face detection; feature extraction; restricted Boltzmann machines; sensor fusion;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2012.2183645
  • Filename
    6149591