DocumentCode
2441964
Title
Feature selection increases cross-validation imprecision
Author
Xiao, Yufei ; Hua, Jianping ; Dougherty, Edward R.
Author_Institution
Dept. of Electr. Eng., Texas A&M Univ., College Station, TX
fYear
2006
fDate
28-30 May 2006
Firstpage
17
Lastpage
18
Abstract
Even without feature selection, cross-validation error estimation is problematic for small samples owing to the high variance of the deviation distribution describing the difference between the estimated and true errors. This paper investigates the increased loss of cross-validation precision owing to feature selection by comparing deviation distributions and introducing two variation-based measures to quantify the further degradation in performance.
Keywords
biology computing; estimation theory; feature extraction; genetics; sampling methods; statistical distributions; cross-validation error estimation; cross-validation imprecision; feature selection; genomics; high variance deviation distribution; Bioinformatics; Degradation; Distributed computing; Error analysis; Gaussian distribution; Genomics; Linear discriminant analysis; Loss measurement; Performance loss; Signal processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
Conference_Location
College Station, TX
Print_ISBN
1-4244-0384-7
Electronic_ISBN
1-4244-0385-5
Type
conf
DOI
10.1109/GENSIPS.2006.353134
Filename
4161755
Link To Document