DocumentCode
589310
Title
Sampling-Based Subnetwork Identification from Microarray Data and Protein-Protein Interaction Network
Author
Xiao Wang ; Jinghua Gu ; Jianhua Xuan ; Li Chen ; Shajahan, Ayesha N. ; Clarke, Roger
Author_Institution
Bradley Dept. of Electr. & Comput. Eng., Virginia Tech, Arlington, VA, USA
Volume
2
fYear
2012
fDate
12-15 Dec. 2012
Firstpage
158
Lastpage
163
Abstract
Identification of condition-specific protein interaction subnetworks has emerged as an attractive research field to reveal molecular mechanisms of diseases and provide reliable network biomarkers for disease diagnosis. Several methods have been proposed, which integrate gene expression and protein-protein interaction (PPI) data to identify subnetworks. However, existing methods treat differential expression of genes and network topology independently, which is an oversimplified assumption to model real biological systems. In this paper, we propose a sampling-based subnetwork identification approach to take into account the dependency between gene expression and network topology. Specifically, we apply Markov random field (MRF) theory to model the dependency of genes in PPI network using a Bayesian framework, followed by a Markov Chain Monte Carlo (MCMC) approach to identify significant subnetworks. The MCMC approach estimates the posterior distribution of genes´ significant scores and network structure iteratively. Experimental results on both synthetic data and real breast cancer data demonstrated the effectiveness of the proposed method in identifying subnetworks, especially several functionally important, aberrant subnetworks associated with pathways involved in the development and recurrence of breast cancer.
Keywords
Bayes methods; Markov processes; Monte Carlo methods; biological organs; cancer; genetics; molecular biophysics; patient diagnosis; proteins; random processes; sampling methods; Bayesian framework; MCMC approach; MRF theory; Markov Chain Monte Carlo approach; Markov random field theory; PPI network; aberrant subnetworks; condition-specific protein interaction subnetworks identification; disease diagnosis; diseases molecular mechanisms; gene expression; genes differential expression; microarray data; network biomarkers; network structure; network topology; posterior distribution estimation; protein-protein interaction network; real biological system modeling; real breast cancer data; sampling-based subnetwork identification; synthetic data; Biomarkers; Breast cancer; Data models; Gene expression; Proteins; Vectors; Breast cancer; Gene expression; Markov Chain Monte Carlo (MCMC); Markov random field (MRF); Protein-protein interaction (PPI); Subnetwork identification;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location
Boca Raton, FL
Print_ISBN
978-1-4673-4651-1
Type
conf
DOI
10.1109/ICMLA.2012.221
Filename
6406744
Link To Document