• Title of article

    Assessing the statistical validity of proteomics based biomarkers Original Research Article

  • Author/Authors

    Suzanne Smit، نويسنده , , Mariëlle J. van Breemen، نويسنده , , Huub C.J. Hoefsloot، نويسنده , , Johan A. Westerhuis and Age K. Smilde، نويسنده , , Johannes M.F.G. Aerts، نويسنده , , Chris G. de Koster، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2007
  • Pages
    8
  • From page
    210
  • To page
    217
  • Abstract
    A strategy is presented for the statistical validation of discrimination models in proteomics studies. Several existing tools are combined to form a solid statistical basis for biomarker discovery that should precede a biochemical validation of any biomarker. These tools consist of permutation tests, single and double cross-validation. The cross-validation steps can simply be combined with a new variable selection method, called rank products. The strategy is especially suited for the low-samples-to-variables-ratio (undersampling) case, as is often encountered in proteomics and metabolomics studies. As a classification method, principal component discriminant analysis is used; however, the methodology can be used with any classifier. A dataset containing serum samples from Gaucher patients and healthy controls serves as a test case. Double cross-validation shows that the sensitivity of the model is 89% and the specificity 90%. Potential putative biomarkers are identified using the novel variable selection method. Results from permutation tests support the choice of double cross-validation as the tool for determining error rates when the modelling procedure involves a tuneable parameter. This shows that even cross-validation does not guarantee unbiased results. The validation of discrimination models with a combination of permutation tests and double cross-validation helps to avoid erroneous results which may result from the undersampling.
  • Keywords
    Surface enh , Curse of dimensionality , Permutation test , Principal component discriminant analysis , Double cross-validation , Gaucher disease , Statistical validation , classification , Biomarker discovery , Rank products
  • Journal title
    Analytica Chimica Acta
  • Serial Year
    2007
  • Journal title
    Analytica Chimica Acta
  • Record number

    1030896