• DocumentCode
    3119900
  • Title

    Principal discriminants analysis for small-sample-size problems: application to chemical sensing

  • Author

    Wang, M. ; Perera, A. ; Gutierrez-Osuna, R.

  • Author_Institution
    Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
  • fYear
    2004
  • fDate
    24-27 Oct. 2004
  • Firstpage
    591
  • Abstract
    Two dimensionality reduction techniques are widely used to analyze data from chemical sensor arrays: Fisher´s linear discriminants analysis (LDA) and principal components analysis (PCA). LDA finds the directions of maximum discrimination in classification problems, but has a tendency to overfit when the ratio of training samples to dimensionality is low, as is commonly the case in chemical sensor array problems. PCA is more robust to overfitting but, being a variance model, fails to capture discriminatory information in low-variance sensors. In this article we propose a hybrid model, termed principal discriminants analysis (PDA), which incorporates both LDA and PCA criteria by means of a regularization parameter. The model is characterized on a synthetic dataset and validated with experimental data from an array of 15 metal-oxide sensors exposed to five varieties of roasted coffee beans. Our results show that PDA provides higher predictive accuracy than LDA or PCA alone. In addition, the model is able to find a trade-off between discriminant- and variance-based projections according to where information is located in the distribution of the data.
  • Keywords
    array signal processing; chemical sensors; gas sensors; principal component analysis; LDA; PCA; chemical sensing; chemical sensor arrays; classification problems; data distribution; dimensionality reduction techniques; discriminant-based projections; discriminatory information; linear discriminants analysis; low-variance sensors; maximum discrimination directions; metal-oxide sensors; predictive accuracy; principal components analysis; principal discriminants analysis; regularization parameter; roasted coffee beans; small-sample-size problems; synthetic dataset characterized model; training samples; variance model; variance-based projections; Application software; Chemical analysis; Chemical sensors; Data analysis; Eigenvalues and eigenfunctions; Linear discriminant analysis; Personal digital assistants; Principal component analysis; Sensor arrays; Sensor phenomena and characterization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Sensors, 2004. Proceedings of IEEE
  • Print_ISBN
    0-7803-8692-2
  • Type

    conf

  • DOI
    10.1109/ICSENS.2004.1426234
  • Filename
    1426234