DocumentCode :
2019259
Title :
All Common Embedded Subtrees for Measuring Tree Similarity
Author :
Lin, Zhiwei ; Wang, Hui ; Mcclean, Sally ; Liu, Chang
Author_Institution :
Fac. of Comput. & Eng., Univ. of Ulster, Belfast
Volume :
1
fYear :
2008
fDate :
17-18 Oct. 2008
Firstpage :
29
Lastpage :
32
Abstract :
Tree similarity measurement is key to tree-like data mining. In order to maximally capture common information between trees, we consider the problem of computing all common embedded subtrees, and advocate using the number/count of all common embedded subtrees as a measure of similarity. This problem is not trivial due to the inherent complexity of trees and the ensued large search space. The problem is theoretically analyzed and an effective algorithm for counting all common embedded subtrees is presented. Experimental evaluation shows that the all common embedded subtree similarity is very competitive against tree edit distance, in terms of both efficiency and effectiveness.
Keywords :
computational complexity; data mining; search problems; tree data structures; trees (mathematics); all common embedded subtree; computational complexity; data mining; search space; tree similarity measurement; Algorithm design and analysis; Computational intelligence; Data engineering; Data mining; Design engineering; Embedded computing; Polynomials; Sequences; Text mining; Tree data structures; all common embedded subtree; tree edit distance; tree similarity measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Design, 2008. ISCID '08. International Symposium on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3311-7
Type :
conf
DOI :
10.1109/ISCID.2008.20
Filename :
4725550
Link To Document :
بازگشت