• DocumentCode
    3409365
  • Title

    Probability profile method - new approach to data analysis in tandem mass spectrometry

  • Author

    Gorin, Andrey ; Day, Robert M. ; Borziak, A. ; Strader, Michael B. ; Hurst, Gregory B. ; Fridman, Terna

  • Author_Institution
    Div. of Comput. Sci. & Mathematics, Oak Ridge Nat. Lab., TN, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    499
  • Lastpage
    502
  • Abstract
    Tandem mass spectrometry (MS/MS) is one of the leading proteomics technologies, applicable to a wide range of experiments involving composition analysis of protein mixtures. Currently only ∼10-20% of MS/MS spectral data lead to the successful peptide identifications, and the rate of false positives remains to be high. We propose probability profile method (PPM) as a new route for the development of MS/MS data analysis algorithms. The principal idea can be described as a probabilistic "labeling" of the individual peaks, or as a detailed analysis of the spectra leading to peak separation into specific categories (b-ion, y-ion, double charged b-ion, etc). PPM "assignments", conducted on large and diverse data sets (∼60,000 spectra), indicate that a large majority of MS/MS peaks can be identified with a surprising level of confidence, providing the foundation for a range of novel algorithmic approaches: spectra can be edited by selecting desirable peak categories; overall characteristics of MS/MS spectra, such as parent ion charge or total number of the present ions, can be rapidly estimated with a high precision; labeled peaks of the same category (e.g. b-ions) can be efficiently connected into de novo tag peptides.
  • Keywords
    biology computing; classification; data analysis; genetics; mass spectra; molecular biophysics; probability; proteins; data analysis; de novo tag peptides; false positives; peptide identifications; probability profile method; protein; proteomics technologies; tandem mass spectrometry; Cells (biology); Chemical technology; Computer science; Data analysis; Databases; Mass spectroscopy; Peptides; Proteins; Proteomics; Satellites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332475
  • Filename
    1332475