• DocumentCode
    2831133
  • Title

    Similarity Analysis of Protein Binding Sites: A Generalization of the Maximum Common Subgraph Measure Based on Quasi-Clique Detection

  • Author

    Boukhris, Imen ; Elouedi, Zied ; Fober, Thomas ; Mernberger, Marco ; Hullermeier, Eyke

  • Author_Institution
    LARODEC Lab., Higher Inst. of Bus., Tunis, Tunisia
  • fYear
    2009
  • fDate
    Nov. 30 2009-Dec. 2 2009
  • Firstpage
    1245
  • Lastpage
    1250
  • Abstract
    Protein binding sites are often represented by means of graphs capturing their most important geometrical and physicochemical properties. Searching for structural similarities and identifying functional relationships between them can thus be reduced to matching their corresponding graph descriptors. In this paper, we propose a method for the structural analysis of protein binding sites that makes use of such matching techniques to assess the similarity between proteins independently of sequence or fold homology. More specifically, we propose a similarity measure that generalizes the commonly used maximum common subgraph measure in two ways. First, using algorithms for so-called quasi-clique detection, our measure is based on maximum `approximately´ common subgraphs, a relaxation of maximum common subgraphs which is tolerant toward edge mismatches. Second, instead of focusing on equivalence, our measure is a compromise between a generalized equivalence and an inclusion measure. An experimental study is presented to illustrate the effectiveness of the method and to show that both types of relaxation are useful in the context of protein structure analysis.
  • Keywords
    bioinformatics; graph theory; proteins; maximum common subgraph measure; physicochemical properties; protein binding sites; protein similarity analysis; protein structure analysis; quasiclique detection; structural analysis; Bioinformatics; Computer science; Data mining; Drugs; Intelligent systems; Laboratories; Mathematics; Protein engineering; Spatial databases; System analysis and design; cliques; graphs; protein binding sites; quasicliques;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2009. ISDA '09. Ninth International Conference on
  • Conference_Location
    Pisa
  • Print_ISBN
    978-1-4244-4735-0
  • Electronic_ISBN
    978-0-7695-3872-3
  • Type

    conf

  • DOI
    10.1109/ISDA.2009.75
  • Filename
    5364132