DocumentCode :
2340592
Title :
Feature selection based on bootstrapping
Author :
Diaz-Diaz, Norberto ; Aguilar-Ruiz, Jesus S. ; Nepomuceno, Juan A. ; García, Jorge
Author_Institution :
Bioinformatics Group Seville, Seville Univ.
fYear :
0
fDate :
0-0 0
Abstract :
The results of feature selection methods have a great influence on the success of data mining processes, especially when the data sets have high dimensionality. In order to find the optimal result from feature selection methods, we should check each possible subset of features to obtain the precision on classification, i.e., an exhaustive search through the search space. However, it is an unfeasible task due to its computational complexity. In this paper, we propose a novel method of feature selection based on bootstrapping techniques. Our approach shows that it is not necessary to try every subset of features, but only a very small subset of combinations to achieve the same performance as the exhaustive approach. The experiments have been carried out using very high-dimensional datasets (thousands of features) and they show that it is possible to maintain the precision at the same time that the complexity is reduced substantially
Keywords :
computational complexity; data mining; feature extraction; pattern classification; bootstrapping technique; computational complexity; data mining; feature selection; Accuracy; Cost function; Data mining; Error analysis; Filters; Postal services; Sampling methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence Methods and Applications, 2005 ICSC Congress on
Conference_Location :
Istanbul
Print_ISBN :
1-4244-0020-1
Type :
conf
DOI :
10.1109/CIMA.2005.1662338
Filename :
1662338
Link To Document :
بازگشت