• DocumentCode
    2369601
  • Title

    iMOWSE, a scoring scheme bridging in silico and in vitro digestion in peptide mass fingerprints

  • Author

    Tsui, Stephen Kwok-wing ; Leung, Ka-Kit

  • Author_Institution
    Sch. of Biomed. Sci., Chinese Univ. of Hong Kong, Hong Kong, China
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    344
  • Lastpage
    344
  • Abstract
    Peptide mass fingerprinting is a popular protein identification method. Presence of false positives and inability to identify a protein remain a challenge. Despite much effort in tackling the problems, little attention is paid to the intrinsic problematic methodology adopted - tryptic digestion followed by identification of the experimentally available peptides. Here, we demonstrated an unexplored issue - cleavage site clusters and formulated a scoring function, iMOWSE to distinguish true positives from false positives. We first showed how the cleavage site clusters skew peptide database composition. Uneven distribution among proteins and peptides results in biased scoring. A protein falls in a group possessing a large amount of short peptides has a higher chance to score higher. This effect cannot be removed by simply limiting database search within a mass range of common experimentally obtainable peptides. Neither does artificially multiplying the factors of such short peptide matches solve the problem. Concerning the information content of an in silico digestion of a protein entry, both the number of matches of long peptides (>800 Da) and the number of such long peptides an entry can generate are taken into account. The resultant scoring function iMOWSE is robust in detecting true positives in our model database, which has a similar composition distribution to common databases msdb and nr. Moreover, it also accommodates better with situations that allow larger tolerance of deviation of query mass values from theoretical mass values; and fewer number of matches are needed to identify true positives.
  • Keywords
    biology computing; proteins; cleavage site clusters skew peptide database composition; iMOWSE; in silico digestion; in vitro digestion; peptide mass fingerprints; protein identification method; scoring scheme; Bioinformatics; Databases; Fingerprint recognition; In vitro; Peptides; Proteins; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-5121-0
  • Type

    conf

  • DOI
    10.1109/BIBMW.2009.5332092
  • Filename
    5332092