• DocumentCode
    2413828
  • Title

    A graph-based elastic net for variable selection and module identification for genomic data analysis

  • Author

    Xia, Zheng ; Zhou, Xiaobo ; Chen, Wei ; Chang, Chunqi

  • Author_Institution
    Dept. of Radiol., Methodist Hosp. Res. Inst., Houston, TX, USA
  • fYear
    2010
  • fDate
    18-21 Dec. 2010
  • Firstpage
    357
  • Lastpage
    362
  • Abstract
    Recently a network-constraint regression model[1] is proposed to incorporate the prior biological knowledge to perform regression and variable selection. In their method, a l1-norm of the coefficients is defined to impose sparse, meanwhile a Laplacian operation on the biological graph is designed to encourage smoothness of the coefficients along the network. However the grouping effect of their Laplacian smoothness operation only exits when the two connected genes both have positive or negative effects on the response. To overcome this problem, we proposed to apply the Laplacian operation on the absolute values of the coefficients to take account of the positive and negative effects. Here, we call the presented method as graph-based elastic net (GENet) because the proposed method has similar grouping effect with elastic net(ENet)[2] except the smoothness of two coefficients are specified by the network in GENet. Further, an efficient algorithm which has same spirit with LARS [3] is developed to solve our optimization problem. Simulation studies showed that the proposed method has better performance than network-constrained regularization without absolute values. Application to Alzheimer´s disease(AD) microarray gene-expression dataset identified several subnetworks on Kyoto Encyclopedia of Genes and Genomes(KEGG) transcriptional pathways that are related to progression of AD. Many of those findings are confirmed by published literatures.
  • Keywords
    cellular biophysics; diseases; genomics; graph theory; medical computing; molecular biophysics; optimisation; regression analysis; sparse matrices; Alzheimer disease; Kyoto Encyclopedia of Genes and Genomes transcriptional pathways; Laplacian operation; Laplacian smoothness; genomic data analysis; graph-based elastic net; l1-norm; microarray gene expression; module identification; network-constrained regularization; network-constraint regression model; optimization; sparse; variable selection; Bioinformatics; Equations; Genomics; Laplace equations; Mathematical model; Optimization; Elastic Net; Laplacian graph; pathway;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-8306-8
  • Electronic_ISBN
    978-1-4244-8307-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2010.5706591
  • Filename
    5706591