DocumentCode
1160172
Title
Toward an accurate analysis of range queries on spatial data
Author
An, Ning ; Jin, Ji ; Sivasubramaniam, Anand
Author_Institution
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
Volume
15
Issue
2
fYear
2003
Firstpage
305
Lastpage
323
Abstract
Analysis of range queries on spatial (multidimensional) data is both important and challenging. Most previous analysis attempts have made certain simplifying assumptions about the data sets and/or queries to keep the analysis tractable. As a result, they may not be universally applicable. This paper proposes a set of five analysis techniques to estimate the selectivity and number of index nodes accessed in serving a range query. The underlying philosophy behind these techniques is to maintain an auxiliary data structure, called a density file, whose creation is a one-time cost, which can be quickly consulted when the query is given. The schemes differ in what information is kept in the density file, how it is maintained, and how this information is looked up. It is shown that one of the proposed schemes, called cumulative density (CD), gives very accurate results (usually less than 5 percent error) using a diverse suite of point and rectangular data sets, that are uniform or skewed, and a wide range of query window parameters. The estimation takes a constant amount of time, which is typically lower than 1 percent of the time that it would take to execute the query, regardless of data set or query window parameters.
Keywords
query processing; visual databases; auxiliary data structure; cumulative density; density file; index nodes; multidimensional data; point data sets; query window parameters; range query; range query analysis; rectangular data sets; selectivity; spatial data; Application software; Costs; Data structures; Image databases; Information retrieval; Information systems; Multidimensional systems; Parameter estimation; Performance analysis; Spatial databases;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2003.1185836
Filename
1185836
Link To Document