Title :
A Comprehensive Statistical Model for Cell Signaling
Author :
Yörük, Erdem ; Ochs, Michael F. ; Geman, Donald ; Younes, Laurent
Author_Institution :
Dept. of Appl. Math. & Stat., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
Protein signaling networks play a central role in transcriptional regulation and the etiology of many diseases. Statistical methods, particularly Bayesian networks, have been widely used to model cell signaling, mostly for model organisms and with focus on uncovering connectivity rather than inferring aberrations. Extensions to mammalian systems have not yielded compelling results, due likely to greatly increased complexity and limited proteomic measurements in vivo. In this study, we propose a comprehensive statistical model that is anchored to a predefined core topology, has a limited complexity due to parameter sharing and uses micorarray data of mRNA transcripts as the only observable components of signaling. Specifically, we account for cell heterogeneity and a multilevel process, representing signaling as a Bayesian network at the cell level, modeling measurements as ensemble averages at the tissue level, and incorporating patient-to-patient differences at the population level. Motivated by the goal of identifying individual protein abnormalities as potential therapeutical targets, we applied our method to the RAS-RAF network using a breast cancer study with 118 patients. We demonstrated rigorous statistical inference, established reproducibility through simulations and the ability to recover receptor status from available microarray data.
Keywords :
belief networks; biochemistry; biological organs; cancer; cellular biophysics; gynaecology; inference mechanisms; medical diagnostic computing; molecular biophysics; physiological models; proteins; proteomics; statistical analysis; tumours; Bayesian networks; RAS-RAF network; breast cancer; cell heterogeneity; cell signaling; comprehensive statistical model; disease; etiology; individual protein abnormalities; limited complexity; limited proteomic measurements; mRNA transcripts; mammalian systems; micorarray data; multilevel process; parameter sharing; patient-to-patient differences; potential therapeutical targets; predefined core topology; protein signaling networks; receptor status; statistical inference; tissue level; transcriptional regulation; Bayesian methods; Biological system modeling; Data models; Gene expression; Mathematical model; Protein engineering; Proteins; Bayesian networks; Cell signaling networks; Gibbs sampling; Mann-Whitney-Wilcoxon test.; microarray; signaling protein; statistical learning; stochastic approximation expectation maximization; Algorithms; Artificial Intelligence; Bayes Theorem; Cell Communication; Computational Biology; Computer Simulation; Gene Expression Profiling; Humans; Hybridization, Genetic; Models, Biological; Oligonucleotide Array Sequence Analysis; RNA, Messenger; Signal Transduction;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.87