• DocumentCode
    2831464
  • Title

    An Improved Clique-Based Method for Computing Edit Distance between Unordered Trees and Its Application to Comparison of Glycan Structures

  • Author

    Akutsu, Tatsuya ; Mori, Tomoya ; Tamura, Takeyuki ; Fukagawa, Daiji ; Takasu, Atsuhiro ; Tomita, Etsuji

  • Author_Institution
    Bioinf. Center, Kyoto Univ., Kyoto, Japan
  • fYear
    2011
  • fDate
    June 30 2011-July 2 2011
  • Firstpage
    536
  • Lastpage
    540
  • Abstract
    The tree edit distance is one of the most widely used measures for comparison of tree structured data and has been used for analysis of RNA secondary structures, glycan structures, and vascular trees. However, it is known that the tree edit distance problem is NP-hard for unordered trees while it is polynomial time solvable for ordered trees. We have recently proposed a clique-based method for computing the tree edit distance between unordered trees in which each instance of the tree edit distance problem is transformed into an instance of the maximum vertex weighted clique problem and then an existing clique algorithm is applied. In this paper, we propose an improved clique-based method. Different from our previous method, the improved method is basically a dynamic programming algorithm that repeatedly solves instances of the maximum vertex weighted clique problem as sub-problems. Other heuristic techniques, which do not violate the optimality of the solution, are also introduced. When applied to comparison of large glycan structures, our improved method showed significant speed-up in most cases.
  • Keywords
    bioinformatics; computational complexity; dynamic programming; macromolecules; trees (mathematics); Glycan structures; NP-hard; RNA secondary structures; dynamic programming algorithm; improved clique-based method; maximum vertex weighted clique problem; polynomial time; tree edit distance; unordered trees; Algorithm design and analysis; Approximation algorithms; Bioinformatics; Computer science; Dynamic programming; Heuristic algorithms; RNA; dynamic programming; glycan; maximum clique; tree edit distance; unordered trees;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Complex, Intelligent and Software Intensive Systems (CISIS), 2011 International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-61284-709-2
  • Electronic_ISBN
    978-0-7695-4373-4
  • Type

    conf

  • DOI
    10.1109/CISIS.2011.88
  • Filename
    5989066