Title :
A stream-based selectivity estimation technique for forward XPath
Author :
Alrammal, Muath ; Hains, Gaétan
Author_Institution :
LACL (Lab. d´´Algorithmique, Complexite et Logique), Univ. Paris-Est, Marne-la-Vallée, France
Abstract :
The Extensible Markup Language (XML) rapidly establishes itself as the de facto standard for presenting, storing, and exchanging data on the Internet. However, querying large volume of XML data represents a bottleneck for several computationally intensive applications. A fast and accurate selectivity estimation mechanism is of practical importance because selectivity estimation plays a fundamental role in XML query performance. Recently proposed techniques are all based on some forms of structure synopses that could be time-consuming to build and not effective for summarizing complex structure relationships. To overcome this limitation, we propose an innovative selectivity estimation algorithm, which consists of (1) the path tree synopsis data structure, a succinct description of the original document with low computational overhead and high accuracy for processing tasks like selectivity estimation, (2) a streaming selectivity estimation algorithm which is efficient for path tree traversal. Extensive experiments on both real and synthetic data sets show that our technique achieves better accuracy and less construction time than existing approaches.
Keywords :
Internet; XML; electronic data interchange; query processing; tree data structures; Internet; complex structure; data exchange; data presentation; data set; data storage; de facto standard; extensible markup language; forward XPath; path tree synopsis data structure; path tree traversal; query processing; streaming selectivity estimation algorithm; task processing; Accuracy; Data structures; Estimation; Grammar; Impedance matching; Internet; XML;
Conference_Titel :
Innovations in Information Technology (IIT), 2012 International Conference on
Conference_Location :
Abu Dhabi
Print_ISBN :
978-1-4673-1100-7
DOI :
10.1109/INNOVATIONS.2012.6207734