DocumentCode
1915041
Title
Pragmatic estimation of join sizes and attribute correlations
Author
Bell, D.A. ; McClean, Sally
Author_Institution
Inst. of Inf., Ulster Univ.
fYear
1989
fDate
6-10 Feb 1989
Firstpage
76
Lastpage
84
Abstract
A method is presented for modeling attribute value distributions in database relations for the purpose of obtaining accurate estimates of intermediate relation sizes during query evaluation. The basic idea is that instead of keeping a single (average) value to represent the number of occurrences of each attribute value, m (typically ten) parameters are kept, each representing the number of occurrences of attribute values in a piece, or partition, corresponding to a subrange of 1/m th of the original value range. The uniformity assumption, taken as an estimation technique rather than as an assumption, holds for each partition, hence the name piecewise uniform. The distribution method is extended to the modeling of important intrarelational attribute correlations. This and other enhancements to the technique such as application to semijoin operation are suggested. The technique is being used on two multidatabase management systems
Keywords
distributed databases; query languages; relational databases; attribute correlations; attribute value distributions; database relations; intrarelational attribute correlations; join sizes; multidatabase management systems; piecewise uniform; query evaluation; semijoin operation; uniformity assumption; Cost function; Data mining; Database languages; Database systems; Energy management; Informatics; Optimization methods; Power system management; Query processing; Relational databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 1989. Proceedings. Fifth International Conference on
Conference_Location
Los Angeles, CA
Print_ISBN
0-8186-1915-5
Type
conf
DOI
10.1109/ICDE.1989.47202
Filename
47202
Link To Document