Title :
Error minimization for approximate computation of range aggregates
Author :
Lin, Xuemin ; Zhang, Qing
Author_Institution :
Sch. of Comput. Sci. & Eng., New South Wales Univ., Sydney, NSW, Australia
Abstract :
Histogram techniques are widely used in commercial database management systems for an estimation of query results. Recently, they have been also used in approximately, processing database queries, especially aggregation queries. Existing research results in this area have been mainly focused on constructing a histogram to approximately represent, as accurate as possible on an intuitive base, the original data frequencies. We propose a novel histogram construction method aiming to minimize the average approximate aggregation errors; and we have developed an efficient algorithm to construct near optimal histograms to achieve this goal. Our experiment results showed that the new histogram construction techniques lead to more accurate results than those by existing histogram techniques, and also out-perform the existing wavelet techniques.
Keywords :
database management systems; minimisation; query processing; aggregation queries; approximate computation; commercial database management systems; error minimization; experiment; histogram techniques; near optimal histograms; query processing; range aggregates; wavelet techniques; Aggregates; Australia; Computer errors; Computer science; Data engineering; Database systems; Frequency; Histograms; Query processing; Sampling methods;
Conference_Titel :
Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings. Eighth International Conference on
Conference_Location :
Kyoto, Japan
Print_ISBN :
0-7695-1895-8
DOI :
10.1109/DASFAA.2003.1192380