• DocumentCode
    1484547
  • Title

    Antilope—A Lagrangian Relaxation Approach to the de novo Peptide Sequencing Problem

  • Author

    Andreotti, S. ; Klau, G.W. ; Reinert, K.

  • Author_Institution
    Inst. fur Inf., Freie Univ. Berlin, Berlin, Germany
  • Volume
    9
  • Issue
    2
  • fYear
    2012
  • Firstpage
    385
  • Lastpage
    394
  • Abstract
    Peptide sequencing from mass spectrometry data is a key step in proteome research. Especially de novo sequencing, the identification of a peptide from its spectrum alone, is still a challenge even for state-of-the-art algorithmic approaches. In this paper, we present antilope, a new fast and flexible approach based on mathematical programming. It builds on the spectrum graph model and works with a variety of scoring schemes. ANTILOPE combines Lagrangian relaxation for solving an integer linear programming formulation with an adaptation of Yen´s k shortest paths algorithm. It shows a significant improvement in running time compared to mixed integer optimization and performs at the same speed like other state-of-the-art tools. We also implemented a generic probabilistic scoring scheme that can be trained automatically for a data set of annotated spectra and is independent of the mass spectrometer type. Evaluations on benchmark data show that antilope is competitive to the popular state-of-the-art programs PepNovo and NovoHMM both in terms of runtime and accuracy. Furthermore, it offers increased flexibility in the number of considered ion types. ANTILOPE will be freely available as part of the open source proteomics library OpenMS.
  • Keywords
    biology computing; linear programming; mass spectroscopy; molecular biophysics; proteins; proteomics; Lagrangian relaxation approach; NovoHMM; OpenMS; PepNovo; antilope; de novo peptide sequencing; integer linear programming; mass spectrometry; open source proteomics library; proteome research; scoring schemes; spectrum graph model; Algorithm design and analysis; Amino acids; Computational biology; Databases; Heuristic algorithms; Optimization; Peptides; Computational proteomics; Lagrangian relaxation; de novo peptide sequencing; discrete optimization.; Algorithms; Mass Spectrometry; Models, Theoretical; Peptides; Proteomics; Sequence Analysis, Protein;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2011.59
  • Filename
    5740842