DocumentCode
2582911
Title
Effective pre-processing strategies for functional clustering of a protein-protein interactions network
Author
Ucar, Duygu ; Parthasarathy, Srinivasan ; Asur, Sitaram ; Wang, Chao
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear
2005
fDate
19-21 Oct. 2005
Firstpage
129
Lastpage
136
Abstract
In this article we present novel preprocessing techniques, based on typological measures of the network, to identify clusters of proteins from protein-protein interaction (PPI) networks wherein each cluster corresponds to a group of functionally similar proteins. The two main problems with analyzing protein-protein interaction networks are their scale-free property and the large number of false positive interactions that they contain. Our preprocessing techniques use a key transformation and separate weighting functions to effectively eliminate suspect edges, potential false positives, from the graph. A useful side-effect of this transformation is that the resulting graph is no longer scale free. We then examine the application of two well-known clustering techniques, namely hierarchical and multilevel graph partitioning on the reduced network. We define suitable statistical metrics to evaluate our clusters meaningfully. From our study, we discover that the application of clustering on the pre-processed network results in significantly improved, biologically relevant and balanced clusters when compared with clusters derived from the original network. We strongly believe that our strategies would prove invaluable to future studies on prediction of protein functionality from PPI networks.
Keywords
biology computing; graphs; molecular biophysics; proteins; statistical analysis; false positive interactions; functional clustering; hierarchical graph partitioning; multilevel graph partitioning; preprocessing strategies; protein functionality; protein-protein interactions network; scale-free property; Bioinformatics; Biological processes; Chaos; Clustering algorithms; Computer science; Databases; Information resources; Ontologies; Partitioning algorithms; Protein engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN
0-7695-2476-1
Type
conf
DOI
10.1109/BIBE.2005.25
Filename
1544458
Link To Document