• DocumentCode
    3073948
  • Title

    Generating Peptide Sequence Tags for Peptide Identification via Tandem Mass Spectrometry

  • Author

    Yu, Changyong ; Wang, Guoren ; Zhao, Yuhai ; Mao, Keming ; Wu, Junjie ; Zhai, Wendan

  • Author_Institution
    Key Lab. of Med. Image Comput., Northeastern Univ., Shenyang, China
  • fYear
    2009
  • fDate
    22-24 June 2009
  • Firstpage
    200
  • Lastpage
    207
  • Abstract
    Large-scale, rapid and accurate protein identification is the crucial basis for further protein analysis in computational proteomics. Searching protein database by use of the protein tandem mass spectra has been a standard solution for solving this problem. Though several algorithms have been proposed, more sensitive and accurate approaches are still needed. In this paper, an effective database search approach is proposed. Prior to searching sequence database, an approach based on a graph-theoretic model is proposed to infer the peptide sequence tag (PST) from the tandem mass spectra data which is the partial sequence of the peptide. Also, an index approach for the protein sequence database is proposed for speeding up the database search and filtering out the incorrect protein sequences. Then, a novel scoring method for evaluating the match between the peptide sequence tag and the protein sequence is proposed for improving the accuracy of the database search result. Finally, we develop an algorithm for solving the problem and implement it as a computer program PepCheck. All the results fore-Check are compared with those of the famous algorithms. Experimental results demonstrate that PepCheck is as accurate as or more accurate than them with the test datasets.
  • Keywords
    bioinformatics; biological techniques; database management systems; graph theory; mass spectroscopy; proteins; proteomics; computational proteomics; computer program PepCheck; graph-theoretic model; peptide identification; peptide sequence tag generation; protein database search; protein identification; protein tandem mass spectra; tandem mass spectrometry; Amino acids; Bioinformatics; Biomedical engineering; Databases; Large-scale systems; Mass spectroscopy; Peptides; Protein sequence; Proteomics; Spine; Database search; Peptide sequence tag; Protein identification; Tandem mass spectra;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-0-7695-3656-9
  • Type

    conf

  • DOI
    10.1109/BIBE.2009.16
  • Filename
    5211281