• DocumentCode
    2690275
  • Title

    A Log-Linear Graphical Model for inferring genetic networks from high-throughput sequencing data

  • Author

    Allen, Genevera I. ; Zhandong Liu

  • Author_Institution
    Dept. of Stat., Rice Univ., Houston, TX, USA
  • fYear
    2012
  • fDate
    4-7 Oct. 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Gaussian graphical models are often used to infer gene networks based on microarray expression data. Many scientists, however, have begun using high-throughput sequencing technologies to measure gene expression. As the resulting high-dimensional count data consists of counts of sequencing reads for each gene, Gaussian graphical models are not optimal for modeling gene networks based on this discrete data. We develop a novel method for estimating high-dimensional Poisson graphical models, the Log-Linear Graphical Model, allowing us to infer networks based on high-throughput sequencing data. Our model assumes a pair-wise Markov property: conditional on all other variables, each variable is Poisson. We estimate our model locally via neighborhood selection by fitting 1-norm penalized log-linear models. Additionally, we develop a fast parallel algorithm permitting us to fit our graphical model to high-dimensional genomic data sets. We illustrate the effectiveness of our methods for recovering network structure from count data through simulations and a case study on breast cancer microRNA networks.
  • Keywords
    Gaussian processes; Markov processes; RNA; bioinformatics; biological organs; cancer; genetics; genomics; Gaussian graphical models; breast cancer microRNA networks; gene expression; high-dimensional Poisson graphical models; high-dimensional count data; high-dimensional genomic data sets; high-throughput sequencing data sets; high-throughput sequencing technology; inferring genetic networks; log-linear graphical model; microarray expression data; network structure; norm penalized log-linear models; pair-wise Markov property; variable is Poisson; Bandwidth; Bioinformatics; Breast cancer; Genomics; Graphical models; Markov random fields; Stability analysis; Markov networks; graphical models; microRNAs; next generation sequencing data; regulatory networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2012 IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-1-4673-2559-2
  • Electronic_ISBN
    978-1-4673-2558-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2012.6392619
  • Filename
    6392619