Title :
Modeling Protein Interacting Groups by Quasi-Bicliques: Complexity, Algorithm, and Application
Author :
Liu, Xiaowen ; Li, Jinyan ; Wang, Lusheng
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong, China
Abstract :
Protein-protein interactions (PPIs) are one of the most important mechanisms in cellular processes. To model protein interaction sites, recent studies have suggested to find interacting protein group pairs from large PPI networks at the first step and then to search conserved motifs within the protein groups to form interacting motif pairs. To consider the noise effect and the incompleteness of biological data, we propose to use quasi-bicliquesior finding interacting protein group pairs. We investigate two new problems that arise from finding interacting protein group pairs: the maximum vertex quasi-biclique problem and the maximum balanced quasi-biclique problem. We prove that both problems are NP-hard. This is a surprising result as the widely known maximum vertex biclique problem is polynomial time solvable [1]. We then propose a heuristic algorithm that uses the greedy method to find the quasi-bicliques from PPI networks. Our experiment results on real data show that this algorithm has a better performance than a benchmark algorithm for identifying highly matched BLOCKS and PRINTS motifs. We also report results of two case studies on interacting motif pairs that map well with two interacting domain pairs in iPfam. Availability: The software and supplementary information are available at http://www.cs.cityu.edu.hk/~lwang/software/ppi/index.html.
Keywords :
bioinformatics; cellular biophysics; graph theory; molecular biophysics; polynomials; proteins; BLOCKS; NP-hard; PPI networks; PRINTS motifs; algorithm; cellular processes; complexity; conserved motifs; greedy method; heuristic algorithm; interacting motif pairs; interacting protein group pairs; maximum balanced quasi-biclique problem; maximum vertex quasi-biclique problem; noise effect; polynomial time solvable; protein interacting groups; protein interaction sites; protein-protein interactions; quasibicliques; Algorithms; NP-hardness; PPI networks; Protein interacting groups; Protein interaction sites; Protein-protein interactions; Quasi-bicliques; interaction sites; quasi-bicliques.; Algorithms; Amino Acid Motifs; Amino Acid Sequence; Computational Biology; Models, Chemical; Molecular Sequence Data; Protein Interaction Mapping; Protein Structure, Tertiary; Proteins;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2008.61