• DocumentCode
    583233
  • Title

    Predicting viral infection by selecting informative biomarkers from temporal high-dimensional gene expression data

  • Author

    Lou, Qiang ; Obradovic, Zoran

  • Author_Institution
    Center for Data Analytics & Biomedicai Inf., Temple Univ., Philadelphia, PA, USA
  • fYear
    2012
  • fDate
    4-7 Oct. 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In order to more accurately predict an individual´s health status, in clinical applications it is often important to perform analysis of high-dimensional gene expression data that varies with time. A major challenge in predicting from such temporal microarray data is that the number of biomarkers used as features is typically much larger than the number of labeled subjects. One way to address this challenge is to perform feature selection as a preprocessing step and then apply a classification method on selected features. However, traditional feature selection methods cannot handle multivariate temporal data without applying techniques that flatten temporal data into a single matrix in advance. In this study, a feature selection filter that can directly select informative features from temporal gene expression data is proposed. In our approach we measure the distance between multivariate temporal data from two subjects. Based on this distance, we define the objective function of temporal margin based feature selection to maximize each subject´s temporal margin in its own relevant subspace. The experimental results on two real flu data sets provide evidence that our method outperforms the alternatives, which flatten the temporal data in advance.
  • Keywords
    cellular biophysics; feature extraction; filtering theory; genetics; lab-on-a-chip; medical information systems; microorganisms; pattern classification; classification method; clinical applications; feature selection filter; health status; informative biomarkers; multivariate temporal data; temporal high-dimensional gene expression data; temporal microarray data; viral infection; Biomarkers; Gene expression; Optimization; Predictive models; Time measurement; Time series analysis; Vectors; feature setection; high dimensional; margin; multivariate functional data; temporal data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-1-4673-2559-2
  • Electronic_ISBN
    978-1-4673-2558-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2012.6392631
  • Filename
    6392631