• DocumentCode
    33917
  • Title

    An Accurate de novo Algorithm for Glycan Topology Determination from Mass Spectra

  • Author

    Liang Dong ; Bing Shi ; Guangdong Tian ; YanBo Li ; Bing Wang ; Mengchu Zhou

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • Volume
    12
  • Issue
    3
  • fYear
    2015
  • fDate
    May-June 1 2015
  • Firstpage
    568
  • Lastpage
    578
  • Abstract
    Determining the glycan topology automatically from mass spectra represents a great challenge. Existing methods fall into approximate and exact ones. The former including greedy and heuristic ones can reduce the computational complexity, but suffer from information lost in the procedure of glycan interpretation. The latter including dynamic programming and exhaustive enumeration are much slower than the former. In the past years, nearly all emerging methods adopted a tree structure to represent a glycan. They share such problems as repetitive peak counting in reconstructing a candidate structure. Besides, tree-based glycan representation methods often have to give different computational formulas for binary and ternary glycans. We propose a new directed acyclic graph structure for glycan representation. Based on it, this work develops a de novo algorithm to accurately reconstruct the tree structure iteratively from mass spectra with logical constraints and some known biosynthesis rules, by a single computational formula. The experiments on multiple complex glycans extracted from human serum show that the proposed algorithm can achieve higher accuracy to determine a glycan topology than prior methods without increasing computational burden.
  • Keywords
    bioinformatics; computational complexity; graph theory; mass spectra; molecular biophysics; molecular configurations; trees (mathematics); biosynthesis rules; computational burden; computational complexity; de novo algorithm; directed acyclic graph structure; glycan representation; glycan topology determination; human serum; iterative tree structure reconstruction; logical constraints; mass spectra; multiple complex glycans; single computational formula; Computational biology; Couplings; Databases; IEEE transactions; Proteins; Silicon; Topology; Denovo; MS/MS; Tandem mass spectrometry (MS/MS); algorithm; de novo algorithm; glycan topology interpretation; mass spectra; optimization;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2368981
  • Filename
    6951360