DocumentCode :
315611
Title :
A cost model for estimating the performance of spatial joins using R-trees
Author :
Huang, Yun-Wu ; Jing, Ning ; Rundensteiner, Elke A.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
1997
fDate :
11-13 Aug 1997
Firstpage :
30
Lastpage :
38
Abstract :
The development of a cost model for predicting the performance of spatial joins has been identified in the literature as an important and difficult problem. The authors present the first cost model that can predict the performance of spatial joins using R-trees. Based on two existing R-trees (join targets), the model first estimates the number of expected I/Os for the join process by assuming a zero buffer size. The method for this estimation extends the cost model for R-tree window queries (developed by Kamel and Faloutsos (1993) and by Pagel et al. (1993)) to also handle spatial joins (which are more complex). In the context of spatial join processing, this number of zero-buffer expected I/Os is not practical for performance prediction in a buffered environment. To model the buffer impact, they use an (exponential) distribution function to measure the probability that a bufferless I/O would cause a page fault in a buffered environment. Based on this probability and the zero-buffer expected I/O cost, the estimated number of I/Os for an R-tree join can then be computed. The comparisons between the predictions from the cost model and the actual results from the experiments based on real GIS maps show that the average relative error ratio is about 10% with a maximum of about 20% for a wide range of buffer sizes. Therefore, our model is a useful tool for the query optimization of spatial join queries
Keywords :
costing; database theory; geographic information systems; input-output programs; probability; query processing; spatial data structures; tree data structures; visual databases; GIS maps; R-tree window queries; R-trees; average relative error ratio; buffer impact; cost model; distribution function; expected I/Os; page fault; probability; query optimization; spatial join performance estimation; spatial join queries; zero buffer size; Costs; Filters; Geographic Information Systems; Image databases; Image processing; Predictive models; Query processing; Spatial databases; US Department of Transportation; Very large scale integration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 1997. Proceedings., Ninth International Conference on
Conference_Location :
Olympia, WA
Print_ISBN :
0-8186-7952-2
Type :
conf
DOI :
10.1109/SSDM.1997.621148
Filename :
621148
Link To Document :
بازگشت