DocumentCode :
2369601
Title :
iMOWSE, a scoring scheme bridging in silico and in vitro digestion in peptide mass fingerprints
Author :
Tsui, Stephen Kwok-wing ; Leung, Ka-Kit
Author_Institution :
Sch. of Biomed. Sci., Chinese Univ. of Hong Kong, Hong Kong, China
fYear :
2009
fDate :
1-4 Nov. 2009
Firstpage :
344
Lastpage :
344
Abstract :
Peptide mass fingerprinting is a popular protein identification method. Presence of false positives and inability to identify a protein remain a challenge. Despite much effort in tackling the problems, little attention is paid to the intrinsic problematic methodology adopted - tryptic digestion followed by identification of the experimentally available peptides. Here, we demonstrated an unexplored issue - cleavage site clusters and formulated a scoring function, iMOWSE to distinguish true positives from false positives. We first showed how the cleavage site clusters skew peptide database composition. Uneven distribution among proteins and peptides results in biased scoring. A protein falls in a group possessing a large amount of short peptides has a higher chance to score higher. This effect cannot be removed by simply limiting database search within a mass range of common experimentally obtainable peptides. Neither does artificially multiplying the factors of such short peptide matches solve the problem. Concerning the information content of an in silico digestion of a protein entry, both the number of matches of long peptides (>800 Da) and the number of such long peptides an entry can generate are taken into account. The resultant scoring function iMOWSE is robust in detecting true positives in our model database, which has a similar composition distribution to common databases msdb and nr. Moreover, it also accommodates better with situations that allow larger tolerance of deviation of query mass values from theoretical mass values; and fewer number of matches are needed to identify true positives.
Keywords :
biology computing; proteins; cleavage site clusters skew peptide database composition; iMOWSE; in silico digestion; in vitro digestion; peptide mass fingerprints; protein identification method; scoring scheme; Bioinformatics; Databases; Fingerprint recognition; In vitro; Peptides; Proteins; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
Type :
conf
DOI :
10.1109/BIBMW.2009.5332092
Filename :
5332092
Link To Document :
بازگشت