DocumentCode
1987522
Title
Identification of contaminants in proteomics mass spectrometry data
Author
Duncan, M. ; Fung, K. ; Wang, H. ; Yen, C. ; Cios, K.
Author_Institution
Colorado Univ. Health Sci. Center, USA
fYear
2003
fDate
11-14 Aug. 2003
Firstpage
409
Lastpage
410
Abstract
This paper discusses the identification of potential contaminants in mass spectrometry data derived from proteomic studies. Contaminant masses are usually submitted with valid peptide masses to the protein identification algorithms which can potentially lead to false positive results. In this paper we present an approach for the automatic identification of contaminant masses so that they can be removed prior to the submission of the peak list for protein identification. For this purpose we have developed an algorithm that clusters mass values. We calculate the frequencies of all masses and then identify possible contaminant masses. We propose that masses that occur with high frequency are contaminants. In our analysis of 78,384 masses derived from 3,029 proteins, we identify 16 possible contaminants. Of these 16, four are known trypsin autolysis peptides. Removing these contaminant masses from the database search will lead to more accurate and reliable protein identification.
Keywords
contamination; identification; mass spectroscopy; proteins; automatic contaminants identification; mass spectrometry; protein identification algorithms; proteomics; trypsin autolysis peptides; Bioinformatics; Mass spectroscopy; Proteomics;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN
0-7695-2000-6
Type
conf
DOI
10.1109/CSB.2003.1227348
Filename
1227348
Link To Document