Title :
Dynamical Systems for Discovering Protein Complexes and Functional Modules from Biological Networks
Author :
Li, Wenyuan ; Liu, Ying ; Huang, Hung-Chung ; Peng, Yanxiong ; Lin, Yongjing ; Ng, Wee-Keong ; Ong, Kok-Leong
Author_Institution :
Dept. of Molecular & Cell Biol., Texas Univ., Richardson, TX
Abstract :
Recent advances in high throughput experiments and annotations via published literature have provided a wealth of interaction maps of several biomolecular networks, including metabolic, protein-protein, and protein-DNA interaction networks. The architecture of these molecular networks reveals important principles of cellular organization and molecular functions. Analyzing such networks, i.e., discovering dense regions in the network, is an important way to identify protein complexes and functional modules. This task has been formulated as the problem of finding heavy subgraphs, the heaviest k-subgraph problem (k-HSP), which itself is NP-hard. However, any method based on the k-HSP requires the parameter k and an exact solution of k-HSP may still end up as a "spurious" heavy subgraph, thus reducing its practicability in analyzing large scale biological networks. We proposed a new formulation, called the rank-HSP, and two dynamical systems to approximate its results. In addition, a novel metric, called the standard deviation and mean ratio (SMR), is proposed for use in "spurious" heavy subgraphs to automate the discovery by setting a fixed threshold. Empirical results on both the simulated graphs and biological networks have demonstrated the efficiency and effectiveness of our proposal
Keywords :
biochemistry; biology computing; computational complexity; graphs; molecular biophysics; optimisation; proteins; NP-hard; biological networks; biomolecular networks; cellular organization; dense regions; dynamical systems; functional modules; heaviest k-subgraph problem; heavy subgraphs; mean ratio; metabolic interaction networks; molecular functions; protein complexes; protein-DNA interaction networks; protein-protein interaction networks; rank-HSP; standard deviation; Bioinformatics; Biological system modeling; Biology computing; Cellular networks; Computer science; Fungi; Genomics; Large-scale systems; Protein engineering; Throughput; Graph algorithms; bioinformatics databases.; evolutionary computing; neural nets; Algorithms; Computer Simulation; Data Interpretation, Statistical; Gene Expression; Models, Biological; Protein Interaction Mapping; Proteome; Signal Transduction;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.070210