DocumentCode :
1991722
Title :
The peaking phenomenon revisited: The case with feature selection
Author :
Sima, Chao ; Dougherty, Edward R.
Author_Institution :
Comput. Biol. Div., Translational Genomics Res. Inst., Phoenix, AZ
fYear :
2008
fDate :
8-10 June 2008
Firstpage :
1
Lastpage :
2
Abstract :
For a fixed sample size, a common phenomenon is that the error of a designed classifier decreases and then increases as the number of features grows. Historically this peaking phenomenon has been studied without taking into account feature selection, which is commonplace in high-dimensional settings. This paper revisits the peaking phenomenon in the presence of feature selection. The error curves tend to fall into three categories: peaking, settling into a plateau, or falling very slowly over a long range of feature-set sizes. It can be concluded that one should be wary of applying peaking results found in the absence of feature selection to settings in which feature selection is employed.
Keywords :
biology computing; feature extraction; genetics; pattern classification; classifier error; feature selection; genomics; peaking phenomenon; Bioinformatics; Biology computing; Chaos; Computational biology; Computer aided software engineering; Covariance matrix; Distributed computing; Error analysis; Genomics; Iterative algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4244-2371-2
Electronic_ISBN :
978-1-4244-2372-9
Type :
conf
DOI :
10.1109/GENSIPS.2008.4555663
Filename :
4555663
Link To Document :
بازگشت