Title :
PBiTree coding and efficient processing of containment joins
Author :
Wang, Wei ; Jiang, Haigeng ; Lu, Hongjun ; Yu, Jeffrey Xu
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Tech., China
Abstract :
We address issue related to containment join processing in tree-structured data such as XML documents. A containment join takes two sets of XML node elements as input and returns pairs of elements such that the containment relationship holds between them. While there are previous algorithms for processing containment joins, they require both element sets either sorted or indexed. We propose a novel and complete containment query processing framework based on a new coding scheme, PBiTree code. The PBiTree code allows us to determine the ancestor-descendant relationship between two elements from their PBiTree-based codes efficiently. We present algorithms in the framework that are optimized for various combinations of settings. In particular, the newly proposed partitioning based algorithms can process containment joins efficiently without sorting or indexes. Experimental results indicate that the containment join processing algorithms based on the proposed coding scheme outperform existing algorithms significantly.
Keywords :
XML; query processing; tree data structures; PBiTree coding; XML documents; XML node elements; ancestor-descendant relationship; containment join processing algorithm; query processing framework; tree-structured data; Data engineering;
Conference_Titel :
Data Engineering, 2003. Proceedings. 19th International Conference on
Print_ISBN :
0-7803-7665-X
DOI :
10.1109/ICDE.2003.1260808