Title :
Holistic twig joins based on Sketch Tree
Author :
Ji, JunWen ; Wang, Yue ; Lei, Kai ; Wang, Tengjiao
Author_Institution :
Key Lab. of High Confidence Software Technol., Peking Univ., Beijing, China
Abstract :
Due to the wide spread of XML and its importance, twig query matching, which has been identified as a core operation in querying XML data, has become a research hotspot. Although a number of algorithms have been proposed recently to process a twig query holistically, existing algorithms can be further improved with the help of the DTD file. In this paper, we (1) proposed the Sketch Tree structure according to the DTD scheme and employed it to translate the queries; (2) proposed a new labeling scheme named Sketch Dewey on the basis of the Sketch Tree to encode not only the elements in XML data but also the query; (3) designed an efficient path matching algorithm called PathCheck and a novel holistic twig join algorithm called TJumperJoin utilizing the Sketch Dewey code. To answer a query, they only need to scan the elements of leaf nodes in the query and benefitting from the Sketch Dewey labeling scheme, they can take advantage of bitwise operation to make acceleration. At last, extensive experiments are performed to examine the efficiency and effectiveness of our proposed techniques.
Keywords :
XML; query processing; tree data structures; DTD file; PathCheck; Sketch Dewey code; Sketch Dewey labeling scheme; TJumperJoin; XML data querying; bitwise operation; holistic twig join algorithm; path matching algorithm; sketch tree structure; twig query matching; Algorithm design and analysis; Databases; Finite element methods; Labeling; Pattern matching; Sections; XML; Sketch Dewey code; Sketch Tree; XML; twig join;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6233872