DocumentCode :
3269883
Title :
A flexible infrastructure for gathering XML statistics and estimating query cardinality
Author :
Freire, Juliana ; Ramanath, Maya ; Zhang, Lingzhi
fYear :
2004
fDate :
30 March-2 April 2004
Firstpage :
857
Abstract :
A key component of XML data management systems is the result size estimator, which estimates the cardinalities of user queries. Estimated cardinalities are needed in a variety of tasks, including query optimization and cost-based storage design; and they can also be used to give users early feedback about the expected outcome of their queries. In contrast to previously proposed result estimators, which use specialized data structures and estimation algorithms, StatiX uses histograms to uniformly capture both the structural and value skew present in documents. The original version of StatiX was built as a proof of concept. With the goal of making the system publicly available, we have built StatiX++, a new and improved version of StatiX, which extends the original system in significant ways. In this demonstration, we show the key features of StatiX++.
Keywords :
XML; data structures; query processing; statistical databases; StatiX++ system; XML data management systems; XML statistics; cost-based storage design; data structures; estimation algorithms; histograms; publicly available system; query cardinality estimation; query optimization; result size estimator; Statistics; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN :
1063-6382
Print_ISBN :
0-7695-2065-0
Type :
conf
DOI :
10.1109/ICDE.2004.1320085
Filename :
1320085
Link To Document :
بازگشت