• DocumentCode
    2582911
  • Title

    Effective pre-processing strategies for functional clustering of a protein-protein interactions network

  • Author

    Ucar, Duygu ; Parthasarathy, Srinivasan ; Asur, Sitaram ; Wang, Chao

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2005
  • fDate
    19-21 Oct. 2005
  • Firstpage
    129
  • Lastpage
    136
  • Abstract
    In this article we present novel preprocessing techniques, based on typological measures of the network, to identify clusters of proteins from protein-protein interaction (PPI) networks wherein each cluster corresponds to a group of functionally similar proteins. The two main problems with analyzing protein-protein interaction networks are their scale-free property and the large number of false positive interactions that they contain. Our preprocessing techniques use a key transformation and separate weighting functions to effectively eliminate suspect edges, potential false positives, from the graph. A useful side-effect of this transformation is that the resulting graph is no longer scale free. We then examine the application of two well-known clustering techniques, namely hierarchical and multilevel graph partitioning on the reduced network. We define suitable statistical metrics to evaluate our clusters meaningfully. From our study, we discover that the application of clustering on the pre-processed network results in significantly improved, biologically relevant and balanced clusters when compared with clusters derived from the original network. We strongly believe that our strategies would prove invaluable to future studies on prediction of protein functionality from PPI networks.
  • Keywords
    biology computing; graphs; molecular biophysics; proteins; statistical analysis; false positive interactions; functional clustering; hierarchical graph partitioning; multilevel graph partitioning; preprocessing strategies; protein functionality; protein-protein interactions network; scale-free property; Bioinformatics; Biological processes; Chaos; Clustering algorithms; Computer science; Databases; Information resources; Ontologies; Partitioning algorithms; Protein engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
  • Print_ISBN
    0-7695-2476-1
  • Type

    conf

  • DOI
    10.1109/BIBE.2005.25
  • Filename
    1544458