• DocumentCode
    1405136
  • Title

    Determination of Glycan Structure from Tandem Mass Spectra

  • Author

    Böcker, Sebastian ; Kehr, Birte ; Rasche, Florian

  • Author_Institution
    Fac. for Math. & Comput. Sci., Dept. of Bioinf., Friedrich-Schiller-Univ. Jena, Jena, Germany
  • Volume
    8
  • Issue
    4
  • fYear
    2011
  • Firstpage
    976
  • Lastpage
    986
  • Abstract
    Glycans are molecules made from simple sugars that form complex tree structures. Glycans constitute one of the most important protein modifications and identification of glycans remains a pressing problem in biology. Unfortunately, the structure of glycans is hard to predict from the genome sequence of an organism. In this paper, we consider the problem of deriving the topology of a glycan solely from tandem mass spectrometry (MS) data. We study, how to generate glycan tree candidates that sufficiently match the sample mass spectrum, avoiding the combinatorial explosion of glycan structures. Unfortunately, the resulting problem is known to be computationally hard. We present an efficient exact algorithm for this problem based on fixed-parameter algorithmics that can process a spectrum in a matter of seconds. We also report some preliminary results of our method on experimental data, combining it with a preliminary candidate evaluation scheme. We show that our approach is fast in applications, and that we can reach very well de novo identification results. Finally, we show how to count the number of glycan topologies for a fixed size or a fixed mass. We generalize this result to count the number of (labeled) trees with bounded out degree, improving on results obtained using Pólya´s enumeration theorem.
  • Keywords
    macromolecules; mass spectra; mass spectroscopic chemical analysis; organic compounds; polymers; Pólya´s enumeration theorem; complex tree structures; fixed-parameter algorithmics; glycan identification; glycan structure determination; glycan topology; protein modifications; simple sugars; tandem mass spectra; tandem mass spectrometry data; Bioinformatics; Computational biology; Ions; Proteins; Spectroscopy; Topology; Computational mass spectrometry; counting trees.; exact algorithms; glycans; parameterized algorithms; Algorithms; Carbohydrate Conformation; Computational Biology; Databases, Factual; Polysaccharides; Tandem Mass Spectrometry;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2010.129
  • Filename
    5669255