DocumentCode
2548203
Title
Efficient Processing of Complex Twig Pattern Matching
Author
Zhu, Jinqing ; Wang, Wei ; Meng, Xiaofeng
Author_Institution
Renmin Univ. of China, Beijing
fYear
2008
fDate
20-22 July 2008
Firstpage
135
Lastpage
140
Abstract
As a de facto standard for information representation and exchange over the Internet, XML has been used extensively in many applications. And XML query technology has attracted more and more attention in data management research community. Standard XML query languages, e.g. XPath and XQuery, use twig pattern as a basic unit to match relevant fragments from a given XML document. However, in most existing work, only simple containment relationships are involved in the twig pattern, which makes it infeasible in many cases. In this paper, we extend the original twig pattern to complex twig pattern (CTP), which may contain ordered relationship between query nodes. We give a detailed analysis of the hard nuts that prevent us from finding an efficient solution for CTP matching, and then propose a novel holistic join algorithm, LBHJ, to handle the CTP efficiently and effectively. We show in experimental results that LBHJ can largely reduce the size of intermediate results and thus improve the query performance significantly according to various metrics when processing CTP with ordered axes.
Keywords
Internet; XML; electronic data interchange; information retrieval; query languages; Internet; XML query languages; XML query technology; XPath; XQuery; complex twig pattern matching; data management; information exchange; information representation; Algorithm design and analysis; Data structures; Database languages; Guidelines; Information management; Information representation; Internet; Pattern matching; Technology management; XML; complex twig query; follow sibling; xml;
fLanguage
English
Publisher
ieee
Conference_Titel
Web-Age Information Management, 2008. WAIM '08. The Ninth International Conference on
Conference_Location
Zhangjiajie Hunan
Print_ISBN
978-0-7695-3185-4
Electronic_ISBN
978-0-7695-3185-4
Type
conf
DOI
10.1109/WAIM.2008.54
Filename
4597006
Link To Document