Title :
Integrating multi-source biological data for transcriptional regulatory module discovery
Author :
Ressom, Habtom W. ; Zhang, Yuji ; Xuan, Jianhua ; Wang, Yue ; Clarke, Robert
Author_Institution :
Georgetown Univ., Washington
Abstract :
The design principles of gene transcriptional regulation networks in cells have been puzzles due to their unknown dynamic and nonlinear mechanisms. Although high-throughput biotechnologies have generated unprecedented amounts of data, the integration of multi-source data to better understand the process of gene regulation has been a challenge in post genomics era. Gene expression data are limited in providing information about the underlying causal relationships among genes. Prior biological knowledge such as protein binding data and gene ontology annotation, albeit limited in quantity, reflects physical processes of gene regulation. In this paper, we introduce a computational framework for utilizing time course gene expression patterns, protein binding data, and gene ontology information to infer transcriptional regulatory modules. The proposed method mainly consists of three parts: (1) a fuzzy c-means clustering approach that exploits gene functional category information to define gene clusters; (2) a network motif detection tool that classifies the transcription factors into different kinds of regulatory modules based on protein binding data; and (3) a recurrent neural network model for each transcription factor that mimics the architecture of the predicted regulatory module. A hybrid of genetic algorithm and particle swarm optimization method is applied to search for gene cluster that may be regulated by the transcription factor and to determine the parameters of the recurrent neural network. The proposed method is tested on yeast cell cycle process. The inferred gene transcriptional regulatory networks are compared with previously reported results in the literature.
Keywords :
biochemistry; biology computing; cellular biophysics; fuzzy set theory; genetic algorithms; genetics; molecular biophysics; ontologies (artificial intelligence); particle swarm optimisation; proteins; recurrent neural nets; fuzzy c-means clustering; gene expression; gene ontology annotation; gene transcriptional regulation networks; genetic algorithm; genomics; multisource biological data; network motif detection; particle swarm optimization; protein binding; recurrent neural network; transcriptional regulatory module discovery; yeast cell cycle; Bioinformatics; Biology computing; Biotechnology; Fuzzy neural networks; Gene expression; Genomics; Ontologies; Predictive models; Proteins; Recurrent neural networks;
Conference_Titel :
Life Science Systems and Applications Workshop, 2007. LISA 2007. IEEE/NIH
Conference_Location :
Bethesda, MD
Print_ISBN :
978-1-4244-1813-8
Electronic_ISBN :
978-1-4244-1813-8
DOI :
10.1109/LSSA.2007.4400915