DocumentCode
1502833
Title
A hybrid estimator for selectivity estimation
Author
Ling, Yibei ; Sun, Wei ; Rishe, Naphtali D. ; Xiang, Xianjing
Author_Institution
Commun. Res. Lab., Bellcore, Morristown, NJ, USA
Volume
11
Issue
2
fYear
1999
Firstpage
338
Lastpage
354
Abstract
Traditional sampling-based estimators infer the actual selectivity of a query based purely on runtime information gathering, excluding the previously collected information, which underutilizes the information available. Table-based and parametric estimators extrapolate the actual selectivity of a query based only on the previously collected information, ignoring online information, which results in inaccurate estimation in a frequently updated environment. We propose a novel hybrid estimator that utilizes and optimally combines the online and previously collected information. A theoretical analysis demonstrates that the online and previously collected information is complementary, and that the comprehensive utilization of the online and previously collected information is of value for further performance improvement. Our theoretical results are validated by a comprehensive experimental study using a practical database, in the presence of insert, delete and update operations. The hybrid approach is very promising in the sense that it provides an adaptive mechanism that allows the optimal combination of information obtained from different sources in order to achieve a higher estimation accuracy and reliability
Keywords
database theory; estimation theory; online operation; query processing; adaptive mechanism; database operations; delete operation; estimation accuracy; estimation reliability; frequently updated environment; hybrid estimator; information utilization; insert operation; online information; optimal information combination; parametric estimators; performance improvement; previously collected information; query optimization; query selectivity estimation; runtime information gathering; sampling-based estimators; table-based estimators; update operation; Computer Society; Databases; Information analysis; Parametric statistics; Performance analysis; Query processing; Runtime; Sampling methods; Statistical distributions; Sun;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.761667
Filename
761667
Link To Document