Title :
A two-phase feature selection method using both filter and wrapper
Author :
Yuan, Huang ; Tseng, Shian-Shyong ; Gangshan, Wu ; Fuyan, Zhang
Author_Institution :
Dept. of Comput. Sci. & Technol., Nanjing Univ., China
fDate :
6/21/1905 12:00:00 AM
Abstract :
Feature selection is an integral step of the data mining process to find an optimal subset of features. After examining the problems with both the filter and the wrapper approach to feature selection, we propose a two-phase (filter and wrapper) feature selection algorithm that can take advantage of both approaches. It begins by running GFSIC (Genetic Feature Selection with Inconsistency Criterion), a filter approach, to remove irrelevant features, then it runs SBFCV (Sensitivity-Based Feature selection with v-fold Cross-Validation), a wrapper approach, to remove redundant or useless features. Analysis and experimental studies show the effectiveness and scalability of the proposed algorithm. The generalization of the neural network is improved when the algorithm is used to pre-process the training data by eliminating irrelevant and useless features from the neural network´s consideration
Keywords :
data mining; feature extraction; feedforward neural nets; generalisation (artificial intelligence); genetic algorithms; learning (artificial intelligence); redundancy; GFSIC; SBFCV; data mining; feedforward neural network; filter approach; genetic feature selection; inconsistency criterion; irrelevant feature removal; neural network generalization; optimal feature subset; redundant feature removal; scalability; sensitivity-based feature selection; training data preprocessing; two-phase feature selection method; useless feature removal; v-fold cross-validation; wrapper approach; Algorithm design and analysis; Computer science; Data mining; Filters; Genetic algorithms; Information science; Laboratories; Neural networks; Neurons; Scalability;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.825221