DocumentCode :
1991696
Title :
A criterion for choosing between full-sample and hold-out classifier design
Author :
Brun, Marcel ; Xu, Qian ; Dougherty, Edward R.
Author_Institution :
Univ. Nac. de Mar del Plata, Mar del Plata
fYear :
2008
fDate :
8-10 June 2008
Firstpage :
1
Lastpage :
2
Abstract :
Is it better to design a classifier and estimate its error on the full sample or to design a classifier on a training subset and estimate its error on the hold-out test subset? Full-sample design provides the better classifier; nevertheless, one might choose hold-out with the hope of better error estimation. A conservative criterion to decide the best course is to aim at a classifier whose error is less than a given bound. Then the choice between full-sample and hold-out design depends on which possesses the smaller expected bound. Using this criterion, we examine the choice between hold-out and several full-sample error estimators using covariance models. The relation between the two designs is revealed via a decomposition of the expected bound into the sum of the expected true error and the expected conditional standard deviation of the true error.
Keywords :
covariance analysis; error analysis; genetics; medical computing; pattern classification; classifier error estimation; conservative criterion; covariance models; error bound decomposition; full sample classifier design; hold out classifier design; true error conditional standard deviation; Bioinformatics; Computational biology; Computer errors; Error analysis; Genomics; Process design; Sampling methods; Strontium; Testing; Tin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4244-2371-2
Electronic_ISBN :
978-1-4244-2372-9
Type :
conf
DOI :
10.1109/GENSIPS.2008.4555662
Filename :
4555662
Link To Document :
بازگشت