DocumentCode
2923403
Title
Two-stage variable selection for molecular prediction of disease
Author
Firouzi, Hamed ; Rajaratnam, Bala ; Hero, Alfred O.
Author_Institution
Dept. of EECS, Univ. of Michigan, Ann Arbor, MI, USA
fYear
2013
fDate
15-18 Dec. 2013
Firstpage
169
Lastpage
172
Abstract
A two-stage predictor strategy is introduced in the context of high dimensional data (large p, small n). Here the focus application is a medical one: prediction of symptomatic infection given molecular expression levels in blood. The first stage of the two-stage predictor uses the previously introduced method of Predictive Correlation Screening (PCS) to select a subset of genes that are important in the prediction of symptom scores. Selected genes are used in the second stage to learn a predictor for the prediction of symptom scores. Under sampling budget constraints we derive the optimal sample allocation rules to the first and second stages of the two-stage predictor. Superiority of the proposed predictor relative to the well known method of LASSO is shown via experiment.
Keywords
blood; diseases; genetics; molecular biophysics; statistical analysis; LASSO method; blood; disease; gene subset; high-dimensional data; molecular expression levels; molecular prediction; sampling budget constraints; symptomatic infection; two-stage variable selection; Conferences; Correlation; Covariance matrices; Gene expression; Heating; Resource management; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP), 2013 IEEE 5th International Workshop on
Conference_Location
St. Martin
Print_ISBN
978-1-4673-3144-9
Type
conf
DOI
10.1109/CAMSAP.2013.6714034
Filename
6714034
Link To Document