Title :
Parallel Computing Algorithms for Reverse-Engineering and Analysis of Genome-Wide Gene Regulatory Networks from Gene Expression Profiles
Author :
Belcastro, Vincenzo ; di Bernardo, D. ; Gregoretti, Francesco ; Oliva, Gennaro
Author_Institution :
Telethon Inst. of Genetics & Med. TIGEM, Naples, Italy
fDate :
Sept. 30 2010-Oct. 1 2010
Abstract :
A Gene Regulatory Network links pairs of genes through an edge if they physically or functionally interact. "Reverse engineering" a gene regulatory network means to infer the edges between genes from the available experimental data. Transcriptional responses (i.e. gene expression profiles obtained through microarray experiments) are often used to reverse-engineer a network of genes. Reverse-engineering consists in analyzing transcriptional responses to a set of treatments and adding an edge between genes if their expressions show a coordinated behavior on a subset of the treatments, according to some underlying model of gene regulation. Mammalian cells contain tens of thousands of genes, and it is necessary to analyze hundreds of transcriptional responses in order to have acceptable statistical evidence of interactions between genes. There currently exist several ready-to-use software packages able to infer gene networks, but few can be used to infer large-size networks from thousands of transcriptional responses as the dimension of the problem leads to high computational costs and memory requirements. We propose to exploit parallel computing techniques to overcome this problem. In this work, we designed and developed a parallel computing algorithm to reverse engineer large-scale gene regulatory networks from tens of thousands of gene expression profiles. The algorithm is based on computing pair-wise Mutual Information between each gene-pair. We successfully tested it to infer the Mus Musculus (mouse) gene regulatory network in liver from 312 expression profiles collected from a public Internet repository. Each profile measures the expression of 45,101 genes (more specifically, transcripts). We analyzed all of the possible gene-pairs for a total amount of about 109 identifying about 6 · 107 edges. We used a hierarchical clustering algorithm to discover communities within the gene network, and found a modular structure that highlights ge- - nes involved in the same biological functions.
Keywords :
biology computing; genetics; parallel algorithms; reverse engineering; software packages; biological function; gene expression profile; gene regulation; genome-wide gene regulatory network; hierarchical clustering algorithm; mammalian cell; modular structure; mus musculus gene regulatory network; parallel computing algorithm; public Internet repository; reverse engineering; software package; Clustering Algorithm; Gene Regulatory Network; Parallel Computing; Reverse Engineering;
Conference_Titel :
Parallel and Distributed Methods in Verification, 2010 Ninth International Workshop on, and High Performance Computational Systems Biology, Second International Workshop on
Conference_Location :
Enschede
Print_ISBN :
978-0-7695-4265-2
DOI :
10.1109/PDMC-HiBi.2010.20