• DocumentCode
    80827
  • Title

    A Local Poisson Graphical Model for Inferring Networks From Sequencing Data

  • Author

    Allen, Genevera I. ; Zhandong Liu

  • Author_Institution
    Dept. of Stat. & Electr. & Comput. Eng., Rice Univ., Houston, TX, USA
  • Volume
    12
  • Issue
    3
  • fYear
    2013
  • fDate
    Sept. 2013
  • Firstpage
    189
  • Lastpage
    198
  • Abstract
    Gaussian graphical models, a class of undirected graphs or Markov Networks, are often used to infer gene networks based on microarray expression data. Many scientists, however, have begun using high-throughput sequencing technologies such as RNA-sequencing or next generation sequencing to measure gene expression. As the resulting data consists of counts of sequencing reads for each gene, Gaussian graphical models are not optimal for this discrete data. In this paper, we propose a novel method for inferring gene networks from sequencing data: the Local Poisson Graphical Model. Our model assumes a Local Markov property where each variable conditional on all other variables is Poisson distributed. We develop a neighborhood selection algorithm to fit our model locally by performing a series of l1 penalized Poisson, or log-linear, regressions. This yields a fast parallel algorithm for estimating networks from next generation sequencing data. In simulations, we illustrate the effectiveness of our methods for recovering network structure from count data. A case study on breast cancer microRNAs (miRNAs), a novel application of graphical models, finds known regulators of breast cancer genes and discovers novel miRNA clusters and hubs that are targets for future research.
  • Keywords
    Markov processes; Poisson distribution; RNA; bioinformatics; biological organs; cancer; data structures; genetics; genomics; molecular biophysics; molecular configurations; parallel algorithms; physiological models; regression analysis; Gaussian graphical model; Markov network; Poisson distribution; RNA next generation sequencing; breast cancer gene regulator; breast cancer microRNA; data network structure; data recovery; data sequencing; gene expression measurement; gene network inferring; high-throughput sequencing technology; l1 penalized Poisson series; l1 penalized Poisson series; local Markov property; local poisson graphical model; log-linear analysis; microarray data expression; neighborhood selection algorithm; parallel algorithm; regression analysis; undirected graph; Bioinformatics; Genomics; Graphical models; Markov random fields; Sequential analysis; Stability analysis; Gene regulatory networks; Markov networks; microRNAs; next generation sequencing data; undirected graphical models; Breast Neoplasms; Computational Biology; Computer Simulation; Female; Gene Regulatory Networks; Humans; Markov Chains; MicroRNAs; Models, Genetic; Poisson Distribution; Sequence Analysis, DNA;
  • fLanguage
    English
  • Journal_Title
    NanoBioscience, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1536-1241
  • Type

    jour

  • DOI
    10.1109/TNB.2013.2263838
  • Filename
    6578145