DocumentCode
2864666
Title
Stability of feature selection algorithms
Author
Kalousis, Alexandros ; Prados, Julien ; Hilario, Melanie
Author_Institution
Dept. of Comput. Sci., Geneva Univ., Switzerland
fYear
2005
fDate
27-30 Nov. 2005
Abstract
With the proliferation of extremely high-dimensional data, feature selection algorithms have become indispensable components of the learning process. Strangely, despite extensive work on the stability of learning algorithms, the stability of feature selection algorithms has been relatively neglected. This study is an attempt to fill that gap by quantifying the sensitivity of feature selection algorithms to variations in the training set. We assess the stability of feature selection algorithms based on the stability of the feature preferences that they express in the form of weights-scores, ranks, or a selected feature subset. We examine a number of measures to quantify the stability of feature preferences and propose an empirical way to estimate them. We perform a series of experiments with several feature selection algorithms on a set of proteomics datasets. The experiments allow us to explore the merits of each stability measure and create stability profiles of the feature selection algorithms. Finally we show how stability profiles can support the choice of a feature selection algorithm.
Keywords
learning (artificial intelligence); pattern classification; feature selection algorithm; learning algorithm stability; stability measure; stability profile; Classification algorithms; Computer science; Data mining; Error analysis; Predictive models; Probability distribution; Proteomics; Sampling methods; Stability analysis; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, Fifth IEEE International Conference on
ISSN
1550-4786
Print_ISBN
0-7695-2278-5
Type
conf
DOI
10.1109/ICDM.2005.135
Filename
1565682
Link To Document