DocumentCode :
1961853
Title :
Analyzing range queries on spatial data
Author :
Jin, Ji ; An, Ning ; Sivasubramaniam, Anand
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear :
2000
fDate :
2000
Firstpage :
525
Lastpage :
534
Abstract :
Analysis of range queries on spatial (multidimensional) data is both important and challenging. Most previous analysis attempts have made certain simplifying assumptions about the data sets and/or queries to keep the analysis tractable. As a result, they may no be universally applicable. This paper proposes a set of five analysis techniques to estimate the selectivity and number of index nodes accessed in serving a range query. The underlying philosophy behind these techniques is to maintain an auxiliary data structure called a density file, whose creation is a one-time cost, which can be quickly consulted when the query is given. The schemes differ in what information is kept in the density file, how it is maintained and how this information is looked up. It is shown that one of the proposed schemes, called “cumulative density” (CD), gives very accurate results (usually less then 5% error) using a diverse suite of point and rectangular data sets, that are uniform or skewed, and a wide range of query window parameters. The estimation takes a constant amount of time, which is typically lower than 1% of the time that it would take to execute the query, regardless of data set or query window parameters
Keywords :
database theory; query processing; spatial data structures; visual databases; auxiliary data structure; cumulative density; data lookup method; density file; index node access; information maintenance; multidimensional data; one-time cost; point data sets; query window parameters; range query analysis; rectangular data sets; selectivity estimation; skewed data sets; spatial data; uniform data sets; Application software; Computer science; Costs; Electronic switching systems; Geographic Information Systems; Image databases; Information retrieval; Multidimensional systems; Performance analysis; Spatial databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2000. Proceedings. 16th International Conference on
Conference_Location :
San Diego, CA
ISSN :
1063-6382
Print_ISBN :
0-7695-0506-6
Type :
conf
DOI :
10.1109/ICDE.2000.839451
Filename :
839451
Link To Document :
بازگشت