Title :
The peaking phenomenon revisited: The case with feature selection
Author :
Sima, Chao ; Dougherty, Edward R.
Author_Institution :
Comput. Biol. Div., Translational Genomics Res. Inst., Phoenix, AZ
Abstract :
For a fixed sample size, a common phenomenon is that the error of a designed classifier decreases and then increases as the number of features grows. Historically this peaking phenomenon has been studied without taking into account feature selection, which is commonplace in high-dimensional settings. This paper revisits the peaking phenomenon in the presence of feature selection. The error curves tend to fall into three categories: peaking, settling into a plateau, or falling very slowly over a long range of feature-set sizes. It can be concluded that one should be wary of applying peaking results found in the absence of feature selection to settings in which feature selection is employed.
Keywords :
biology computing; feature extraction; genetics; pattern classification; classifier error; feature selection; genomics; peaking phenomenon; Bioinformatics; Biology computing; Chaos; Computational biology; Computer aided software engineering; Covariance matrix; Distributed computing; Error analysis; Genomics; Iterative algorithms;
Conference_Titel :
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4244-2371-2
Electronic_ISBN :
978-1-4244-2372-9
DOI :
10.1109/GENSIPS.2008.4555663