DocumentCode :
2864666
Title :
Stability of feature selection algorithms
Author :
Kalousis, Alexandros ; Prados, Julien ; Hilario, Melanie
Author_Institution :
Dept. of Comput. Sci., Geneva Univ., Switzerland
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
With the proliferation of extremely high-dimensional data, feature selection algorithms have become indispensable components of the learning process. Strangely, despite extensive work on the stability of learning algorithms, the stability of feature selection algorithms has been relatively neglected. This study is an attempt to fill that gap by quantifying the sensitivity of feature selection algorithms to variations in the training set. We assess the stability of feature selection algorithms based on the stability of the feature preferences that they express in the form of weights-scores, ranks, or a selected feature subset. We examine a number of measures to quantify the stability of feature preferences and propose an empirical way to estimate them. We perform a series of experiments with several feature selection algorithms on a set of proteomics datasets. The experiments allow us to explore the merits of each stability measure and create stability profiles of the feature selection algorithms. Finally we show how stability profiles can support the choice of a feature selection algorithm.
Keywords :
learning (artificial intelligence); pattern classification; feature selection algorithm; learning algorithm stability; stability measure; stability profile; Classification algorithms; Computer science; Data mining; Error analysis; Predictive models; Probability distribution; Proteomics; Sampling methods; Stability analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.135
Filename :
1565682
Link To Document :
بازگشت