• DocumentCode
    1502833
  • Title

    A hybrid estimator for selectivity estimation

  • Author

    Ling, Yibei ; Sun, Wei ; Rishe, Naphtali D. ; Xiang, Xianjing

  • Author_Institution
    Commun. Res. Lab., Bellcore, Morristown, NJ, USA
  • Volume
    11
  • Issue
    2
  • fYear
    1999
  • Firstpage
    338
  • Lastpage
    354
  • Abstract
    Traditional sampling-based estimators infer the actual selectivity of a query based purely on runtime information gathering, excluding the previously collected information, which underutilizes the information available. Table-based and parametric estimators extrapolate the actual selectivity of a query based only on the previously collected information, ignoring online information, which results in inaccurate estimation in a frequently updated environment. We propose a novel hybrid estimator that utilizes and optimally combines the online and previously collected information. A theoretical analysis demonstrates that the online and previously collected information is complementary, and that the comprehensive utilization of the online and previously collected information is of value for further performance improvement. Our theoretical results are validated by a comprehensive experimental study using a practical database, in the presence of insert, delete and update operations. The hybrid approach is very promising in the sense that it provides an adaptive mechanism that allows the optimal combination of information obtained from different sources in order to achieve a higher estimation accuracy and reliability
  • Keywords
    database theory; estimation theory; online operation; query processing; adaptive mechanism; database operations; delete operation; estimation accuracy; estimation reliability; frequently updated environment; hybrid estimator; information utilization; insert operation; online information; optimal information combination; parametric estimators; performance improvement; previously collected information; query optimization; query selectivity estimation; runtime information gathering; sampling-based estimators; table-based estimators; update operation; Computer Society; Databases; Information analysis; Parametric statistics; Performance analysis; Query processing; Runtime; Sampling methods; Statistical distributions; Sun;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.761667
  • Filename
    761667