• DocumentCode
    2923403
  • Title

    Two-stage variable selection for molecular prediction of disease

  • Author

    Firouzi, Hamed ; Rajaratnam, Bala ; Hero, Alfred O.

  • Author_Institution
    Dept. of EECS, Univ. of Michigan, Ann Arbor, MI, USA
  • fYear
    2013
  • fDate
    15-18 Dec. 2013
  • Firstpage
    169
  • Lastpage
    172
  • Abstract
    A two-stage predictor strategy is introduced in the context of high dimensional data (large p, small n). Here the focus application is a medical one: prediction of symptomatic infection given molecular expression levels in blood. The first stage of the two-stage predictor uses the previously introduced method of Predictive Correlation Screening (PCS) to select a subset of genes that are important in the prediction of symptom scores. Selected genes are used in the second stage to learn a predictor for the prediction of symptom scores. Under sampling budget constraints we derive the optimal sample allocation rules to the first and second stages of the two-stage predictor. Superiority of the proposed predictor relative to the well known method of LASSO is shown via experiment.
  • Keywords
    blood; diseases; genetics; molecular biophysics; statistical analysis; LASSO method; blood; disease; gene subset; high-dimensional data; molecular expression levels; molecular prediction; sampling budget constraints; symptomatic infection; two-stage variable selection; Conferences; Correlation; Covariance matrices; Gene expression; Heating; Resource management; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP), 2013 IEEE 5th International Workshop on
  • Conference_Location
    St. Martin
  • Print_ISBN
    978-1-4673-3144-9
  • Type

    conf

  • DOI
    10.1109/CAMSAP.2013.6714034
  • Filename
    6714034