Title :
Structural Entropic Difference: A Bounded Distance Metric for Unordered Trees
Author :
Connor, Richard ; Simeoni, Fabio ; Iakovos, Michael
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Strathclyde, Glasgow, UK
Abstract :
We show a new metric for comparing unordered, tree-structured data. While such data is increasingly important in its own right, the methodology underlying the construction of the metric is generic and may be reused for other classes of ordered and partially ordered data. The metric is based on the information content of the two values under consideration, which is measured using Shannon´s entropy equations. In essence, the more commonality the values possess, the closer they are. As values in this domain may have no commonality, a good metric should be bounded to represent this. This property has been achieved, but is in tension with triangle inequality.
Keywords :
tree data structures; bounded distance metric; partially ordered data; structural entropic difference; tree-structured data; Application software; Entropy; Equations; Extraterrestrial measurements; Testing; Tree data structures; XML; distance metric; entropy; information content; information distance; tree comparison; unordered tree;
Conference_Titel :
Similarity Search and Applications, 2009. SISAP '09. Second International Workshop on
Conference_Location :
Prague
Print_ISBN :
978-0-7695-3765-8
DOI :
10.1109/SISAP.2009.29