• DocumentCode
    2864666
  • Title

    Stability of feature selection algorithms

  • Author

    Kalousis, Alexandros ; Prados, Julien ; Hilario, Melanie

  • Author_Institution
    Dept. of Comput. Sci., Geneva Univ., Switzerland
  • fYear
    2005
  • fDate
    27-30 Nov. 2005
  • Abstract
    With the proliferation of extremely high-dimensional data, feature selection algorithms have become indispensable components of the learning process. Strangely, despite extensive work on the stability of learning algorithms, the stability of feature selection algorithms has been relatively neglected. This study is an attempt to fill that gap by quantifying the sensitivity of feature selection algorithms to variations in the training set. We assess the stability of feature selection algorithms based on the stability of the feature preferences that they express in the form of weights-scores, ranks, or a selected feature subset. We examine a number of measures to quantify the stability of feature preferences and propose an empirical way to estimate them. We perform a series of experiments with several feature selection algorithms on a set of proteomics datasets. The experiments allow us to explore the merits of each stability measure and create stability profiles of the feature selection algorithms. Finally we show how stability profiles can support the choice of a feature selection algorithm.
  • Keywords
    learning (artificial intelligence); pattern classification; feature selection algorithm; learning algorithm stability; stability measure; stability profile; Classification algorithms; Computer science; Data mining; Error analysis; Predictive models; Probability distribution; Proteomics; Sampling methods; Stability analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, Fifth IEEE International Conference on
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2278-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2005.135
  • Filename
    1565682