• DocumentCode
    2441964
  • Title

    Feature selection increases cross-validation imprecision

  • Author

    Xiao, Yufei ; Hua, Jianping ; Dougherty, Edward R.

  • Author_Institution
    Dept. of Electr. Eng., Texas A&M Univ., College Station, TX
  • fYear
    2006
  • fDate
    28-30 May 2006
  • Firstpage
    17
  • Lastpage
    18
  • Abstract
    Even without feature selection, cross-validation error estimation is problematic for small samples owing to the high variance of the deviation distribution describing the difference between the estimated and true errors. This paper investigates the increased loss of cross-validation precision owing to feature selection by comparing deviation distributions and introducing two variation-based measures to quantify the further degradation in performance.
  • Keywords
    biology computing; estimation theory; feature extraction; genetics; sampling methods; statistical distributions; cross-validation error estimation; cross-validation imprecision; feature selection; genomics; high variance deviation distribution; Bioinformatics; Degradation; Distributed computing; Error analysis; Gaussian distribution; Genomics; Linear discriminant analysis; Loss measurement; Performance loss; Signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics, 2006. GENSIPS '06. IEEE International Workshop on
  • Conference_Location
    College Station, TX
  • Print_ISBN
    1-4244-0384-7
  • Electronic_ISBN
    1-4244-0385-5
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2006.353134
  • Filename
    4161755