• DocumentCode
    140896
  • Title

    A general algorithm for subtree similarity-search

  • Author

    Cohen, Sholom ; Or, Nerya

  • Author_Institution
    Rachel & Selim Benin Sch. of Comput. Sci. & Eng., Hebrew Univ. of Jerusalem, Jerusalem, Israel
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    928
  • Lastpage
    939
  • Abstract
    Determining similarity between trees is an important problem in a variety of areas. The subtree similarity-search problem is that of finding, given a tree Q and a large set of trees Γ = {T1; ...; Tn}, the subtrees of trees among Γ that are most similar to Q. Similarity is defined using some tree distance function. While subtree similarity-search has been studied in the past, solutions mostly focused on specific tree distance functions, and were usually applicable only to ordered trees. This paper presents an efficient new algorithm that solves the subtree similarity-search problem, and is compatible with a wide family of tree distance functions (for both ordered and unordered trees). Extensive experimentation confirms the efficiency and scalability of the algorithm, which displays consistently good runtime even for large queries and datasets.
  • Keywords
    computational complexity; tree searching; NP-complete; subtree similarity-search problem; tree distance function; unordered trees; Context; Databases; Heuristic algorithms; Polynomials; Runtime; TV; Vegetation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816712
  • Filename
    6816712