• DocumentCode
    1991696
  • Title

    A criterion for choosing between full-sample and hold-out classifier design

  • Author

    Brun, Marcel ; Xu, Qian ; Dougherty, Edward R.

  • Author_Institution
    Univ. Nac. de Mar del Plata, Mar del Plata
  • fYear
    2008
  • fDate
    8-10 June 2008
  • Firstpage
    1
  • Lastpage
    2
  • Abstract
    Is it better to design a classifier and estimate its error on the full sample or to design a classifier on a training subset and estimate its error on the hold-out test subset? Full-sample design provides the better classifier; nevertheless, one might choose hold-out with the hope of better error estimation. A conservative criterion to decide the best course is to aim at a classifier whose error is less than a given bound. Then the choice between full-sample and hold-out design depends on which possesses the smaller expected bound. Using this criterion, we examine the choice between hold-out and several full-sample error estimators using covariance models. The relation between the two designs is revealed via a decomposition of the expected bound into the sum of the expected true error and the expected conditional standard deviation of the true error.
  • Keywords
    covariance analysis; error analysis; genetics; medical computing; pattern classification; classifier error estimation; conservative criterion; covariance models; error bound decomposition; full sample classifier design; hold out classifier design; true error conditional standard deviation; Bioinformatics; Computational biology; Computer errors; Error analysis; Genomics; Process design; Sampling methods; Strontium; Testing; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
  • Conference_Location
    Phoenix, AZ
  • Print_ISBN
    978-1-4244-2371-2
  • Electronic_ISBN
    978-1-4244-2372-9
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2008.4555662
  • Filename
    4555662