• DocumentCode
    3519162
  • Title

    Identifying Interface Elements Implied in Protein-Protein Interactions Using Statistical Tests and Frequent Item Sets

  • Author

    Martin, Christine ; Cornuejols, A.

  • Author_Institution
    LIMSI, Univ. d´´Orsay Paris Sud, Orsay
  • fYear
    2008
  • fDate
    3-5 Nov. 2008
  • Firstpage
    78
  • Lastpage
    83
  • Abstract
    Understanding what are the characteristics of protein-protein interfaces is at the core of numerous applications.This paper introduces a method in which the proteins are described with surfacic geometrical elements. Starting from a database of known interfaces, the method produces the elements and combinations thereof that are characteristic of the interfaces. This is done thanks to a frequent item set technique and the use of statistical tests to ensure a marked difference with a null hypothesis. This approach allows one to easily interpret the results, as compared to techniques that operate as ldquoblack-boxesrdquo. Furthermore, it is naturally adapted to discover disjunctive concepts, i.e. different underlying processes. The results obtained on a set of 459 protein-protein interfaces from the PDB database confirm that the findings are consistent with current knowledge about protein-protein interfaces.
  • Keywords
    biology computing; proteins; statistical analysis; PDB database; frequent item sets; interface elements; protein-protein interaction; statistical tests; surfacic geometrical elements; Bioinformatics; Data mining; Dictionaries; Learning systems; Machine learning; Predictive models; Proteins; Spatial databases; Testing; Transaction databases; data mining; frequent item sets; protein-protein interactions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    978-0-7695-3452-7
  • Type

    conf

  • DOI
    10.1109/BIBM.2008.68
  • Filename
    4684876