• DocumentCode
    2531084
  • Title

    Multi-stage Framework to Infer Protein Functional Modules from Mass Spectrometry Pull-Down Data with Assessment of Biological Relevance

  • Author

    Park, Byung-Hoon ; Zhang, Bing ; Karpinets, Tatiana ; Samatova, Nagiza F.

  • Author_Institution
    Comput. Biol. Inst., Oak Ridge
  • fYear
    2007
  • fDate
    2-4 Nov. 2007
  • Firstpage
    223
  • Lastpage
    229
  • Abstract
    Protein functional modules are fundamental units in protein interaction networks. High-throughput Mass Spectrometry (MS) technology has become valuable for discovery of protein functional modules. Yet, their computational inference from MS pull-down data and biological significance evaluation are still challenging. This paper introduces an integrated multi-step framework for (1) assessing protein-protein interaction affinities, (2) constructing a genome-wide protein association map, (3) finding putative protein functional modules, and (4) evaluating their biological relevance. The protein affinity score utilizes co- purification pattern of two proteins and adopts an information theoretic-approach to build the protein affinity map. Putative protein modules are then derived using a graph-theoretical approach. A two-stage statistical procedure assesses biological relevance of identified modules. On Saccharomyces cerevisiae´s pull-down data (Nature, vol. 415, pp. 141-7, 2002), the scoring scheme outperformed other methods by at least 10% in F1-measure, and statistical tests identified 489 protein modules enriched in all of three general GO categories with p-values less than 0.05.
  • Keywords
    association; biochemistry; genetics; graph theory; information theory; mass spectra; molecular biophysics; proteins; Saccharomyces cerevisiae; genome; graph theory; information theory; mass spectrometry; multistage framework; protein affinity; protein association map; protein functional modules; protein-protein interaction; Bioinformatics; Biological information theory; Biology computing; Clustering algorithms; Computational biology; Computer science; Iterative algorithms; Mass spectroscopy; Proteins; Proteomics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3031-4
  • Type

    conf

  • DOI
    10.1109/BIBM.2007.14
  • Filename
    4413059