DocumentCode
80827
Title
A Local Poisson Graphical Model for Inferring Networks From Sequencing Data
Author
Allen, Genevera I. ; Zhandong Liu
Author_Institution
Dept. of Stat. & Electr. & Comput. Eng., Rice Univ., Houston, TX, USA
Volume
12
Issue
3
fYear
2013
fDate
Sept. 2013
Firstpage
189
Lastpage
198
Abstract
Gaussian graphical models, a class of undirected graphs or Markov Networks, are often used to infer gene networks based on microarray expression data. Many scientists, however, have begun using high-throughput sequencing technologies such as RNA-sequencing or next generation sequencing to measure gene expression. As the resulting data consists of counts of sequencing reads for each gene, Gaussian graphical models are not optimal for this discrete data. In this paper, we propose a novel method for inferring gene networks from sequencing data: the Local Poisson Graphical Model. Our model assumes a Local Markov property where each variable conditional on all other variables is Poisson distributed. We develop a neighborhood selection algorithm to fit our model locally by performing a series of l1 penalized Poisson, or log-linear, regressions. This yields a fast parallel algorithm for estimating networks from next generation sequencing data. In simulations, we illustrate the effectiveness of our methods for recovering network structure from count data. A case study on breast cancer microRNAs (miRNAs), a novel application of graphical models, finds known regulators of breast cancer genes and discovers novel miRNA clusters and hubs that are targets for future research.
Keywords
Markov processes; Poisson distribution; RNA; bioinformatics; biological organs; cancer; data structures; genetics; genomics; molecular biophysics; molecular configurations; parallel algorithms; physiological models; regression analysis; Gaussian graphical model; Markov network; Poisson distribution; RNA next generation sequencing; breast cancer gene regulator; breast cancer microRNA; data network structure; data recovery; data sequencing; gene expression measurement; gene network inferring; high-throughput sequencing technology; l1 penalized Poisson series; l1 penalized Poisson series; local Markov property; local poisson graphical model; log-linear analysis; microarray data expression; neighborhood selection algorithm; parallel algorithm; regression analysis; undirected graph; Bioinformatics; Genomics; Graphical models; Markov random fields; Sequential analysis; Stability analysis; Gene regulatory networks; Markov networks; microRNAs; next generation sequencing data; undirected graphical models; Breast Neoplasms; Computational Biology; Computer Simulation; Female; Gene Regulatory Networks; Humans; Markov Chains; MicroRNAs; Models, Genetic; Poisson Distribution; Sequence Analysis, DNA;
fLanguage
English
Journal_Title
NanoBioscience, IEEE Transactions on
Publisher
ieee
ISSN
1536-1241
Type
jour
DOI
10.1109/TNB.2013.2263838
Filename
6578145
Link To Document