DocumentCode
583233
Title
Predicting viral infection by selecting informative biomarkers from temporal high-dimensional gene expression data
Author
Lou, Qiang ; Obradovic, Zoran
Author_Institution
Center for Data Analytics & Biomedicai Inf., Temple Univ., Philadelphia, PA, USA
fYear
2012
fDate
4-7 Oct. 2012
Firstpage
1
Lastpage
4
Abstract
In order to more accurately predict an individual´s health status, in clinical applications it is often important to perform analysis of high-dimensional gene expression data that varies with time. A major challenge in predicting from such temporal microarray data is that the number of biomarkers used as features is typically much larger than the number of labeled subjects. One way to address this challenge is to perform feature selection as a preprocessing step and then apply a classification method on selected features. However, traditional feature selection methods cannot handle multivariate temporal data without applying techniques that flatten temporal data into a single matrix in advance. In this study, a feature selection filter that can directly select informative features from temporal gene expression data is proposed. In our approach we measure the distance between multivariate temporal data from two subjects. Based on this distance, we define the objective function of temporal margin based feature selection to maximize each subject´s temporal margin in its own relevant subspace. The experimental results on two real flu data sets provide evidence that our method outperforms the alternatives, which flatten the temporal data in advance.
Keywords
cellular biophysics; feature extraction; filtering theory; genetics; lab-on-a-chip; medical information systems; microorganisms; pattern classification; classification method; clinical applications; feature selection filter; health status; informative biomarkers; multivariate temporal data; temporal high-dimensional gene expression data; temporal microarray data; viral infection; Biomarkers; Gene expression; Optimization; Predictive models; Time measurement; Time series analysis; Vectors; feature setection; high dimensional; margin; multivariate functional data; temporal data;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
978-1-4673-2559-2
Electronic_ISBN
978-1-4673-2558-5
Type
conf
DOI
10.1109/BIBM.2012.6392631
Filename
6392631
Link To Document