DocumentCode :
3409365
Title :
Probability profile method - new approach to data analysis in tandem mass spectrometry
Author :
Gorin, Andrey ; Day, Robert M. ; Borziak, A. ; Strader, Michael B. ; Hurst, Gregory B. ; Fridman, Terna
Author_Institution :
Div. of Comput. Sci. & Mathematics, Oak Ridge Nat. Lab., TN, USA
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
499
Lastpage :
502
Abstract :
Tandem mass spectrometry (MS/MS) is one of the leading proteomics technologies, applicable to a wide range of experiments involving composition analysis of protein mixtures. Currently only ∼10-20% of MS/MS spectral data lead to the successful peptide identifications, and the rate of false positives remains to be high. We propose probability profile method (PPM) as a new route for the development of MS/MS data analysis algorithms. The principal idea can be described as a probabilistic "labeling" of the individual peaks, or as a detailed analysis of the spectra leading to peak separation into specific categories (b-ion, y-ion, double charged b-ion, etc). PPM "assignments", conducted on large and diverse data sets (∼60,000 spectra), indicate that a large majority of MS/MS peaks can be identified with a surprising level of confidence, providing the foundation for a range of novel algorithmic approaches: spectra can be edited by selecting desirable peak categories; overall characteristics of MS/MS spectra, such as parent ion charge or total number of the present ions, can be rapidly estimated with a high precision; labeled peaks of the same category (e.g. b-ions) can be efficiently connected into de novo tag peptides.
Keywords :
biology computing; classification; data analysis; genetics; mass spectra; molecular biophysics; probability; proteins; data analysis; de novo tag peptides; false positives; peptide identifications; probability profile method; protein; proteomics technologies; tandem mass spectrometry; Cells (biology); Chemical technology; Computer science; Data analysis; Databases; Mass spectroscopy; Peptides; Proteins; Proteomics; Satellites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332475
Filename :
1332475
Link To Document :
بازگشت