DocumentCode :
1591779
Title :
Selectivity estimation for joins using systematic sampling
Author :
Harangsri, Banchong ; Shepherd, John ; Ngu, Anne
Author_Institution :
Sch. of Comput. Sci. & Eng., New South Wales Univ., Sydney, NSW, Australia
fYear :
1997
Firstpage :
384
Lastpage :
389
Abstract :
Proposes a new approach to the estimation of join selectivity. The technique, which we have called “systematic sampling”, is a novel variant of the sampling-based approach. Systematic sampling works as follows. Given a relation R of N tuples, with a join attribute that can be accessed in ascending/descending order via an index, if n is the number of tuples to be sampled from R, select a tuple at random from the first k=[N/n] tuples of R and every kth tuple thereafter. We first develop a theoretical foundation for systematic sampling which suggests that the method gives a more representative sample than the traditional simple random sampling. Subsequent experimental analysis on a range of synthetic relations confirms that the quality of sample relations (participating in a join) yielded by systematic sampling is higher than those produced by the traditional simple random sampling. To ensure that the sample relations produced by the systematic sampling indeed assist in computation for more accurate join selectivities, we compare the systematic sampling with the most efficient simple random sampling called t-cross, using a variety of star joins and a variety of relation configurations. The results demonstrate that, with the same amount of sampling, the systematic sampling can provide considerably more accurate join selectivities than the t-cross sampling
Keywords :
database theory; relational algebra; statistical analysis; index; join attribute; join selectivity estimation; random sampling; relation configurations; star joins; systematic sampling; t-cross sampling; tuples; Australia; Computer science; Cost function; Data structures; Database systems; Query processing; Sampling methods; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database and Expert Systems Applications, 1997. Proceedings., Eighth International Workshop on
Conference_Location :
Toulouse
Print_ISBN :
0-8186-8147-0
Type :
conf
DOI :
10.1109/DEXA.1997.617318
Filename :
617318
Link To Document :
بازگشت