DocumentCode
2543802
Title
Holistic twig joins based on Sketch Tree
Author
Ji, JunWen ; Wang, Yue ; Lei, Kai ; Wang, Tengjiao
Author_Institution
Key Lab. of High Confidence Software Technol., Peking Univ., Beijing, China
fYear
2012
fDate
29-31 May 2012
Firstpage
1191
Lastpage
1195
Abstract
Due to the wide spread of XML and its importance, twig query matching, which has been identified as a core operation in querying XML data, has become a research hotspot. Although a number of algorithms have been proposed recently to process a twig query holistically, existing algorithms can be further improved with the help of the DTD file. In this paper, we (1) proposed the Sketch Tree structure according to the DTD scheme and employed it to translate the queries; (2) proposed a new labeling scheme named Sketch Dewey on the basis of the Sketch Tree to encode not only the elements in XML data but also the query; (3) designed an efficient path matching algorithm called PathCheck and a novel holistic twig join algorithm called TJumperJoin utilizing the Sketch Dewey code. To answer a query, they only need to scan the elements of leaf nodes in the query and benefitting from the Sketch Dewey labeling scheme, they can take advantage of bitwise operation to make acceleration. At last, extensive experiments are performed to examine the efficiency and effectiveness of our proposed techniques.
Keywords
XML; query processing; tree data structures; DTD file; PathCheck; Sketch Dewey code; Sketch Dewey labeling scheme; TJumperJoin; XML data querying; bitwise operation; holistic twig join algorithm; path matching algorithm; sketch tree structure; twig query matching; Algorithm design and analysis; Databases; Finite element methods; Labeling; Pattern matching; Sections; XML; Sketch Dewey code; Sketch Tree; XML; twig join;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location
Sichuan
Print_ISBN
978-1-4673-0025-4
Type
conf
DOI
10.1109/FSKD.2012.6233872
Filename
6233872
Link To Document