• DocumentCode
    3268216
  • Title

    Selectivity estimation for string predicates: overcoming the underestimation problem

  • Author

    Chaudhuri, Surajit ; Ganti, Venkatesh ; Gravano, Luis

  • fYear
    2004
  • fDate
    30 March-2 April 2004
  • Firstpage
    227
  • Lastpage
    238
  • Abstract
    Queries with (equality or LIKE) selection predicates over string attributes are widely used in relational databases. However, state-of-the-art techniques for estimating selectivities of string predicates are often biased towards severely underestimating selectivities. We develop accurate selectivity estimators for string predicates that adapt to data and query characteristics, and which can exploit and build on a variety of existing estimators. A thorough experimental evaluation over real data sets demonstrates the resilience of our estimators to variations in both data and query characteristics.
  • Keywords
    query processing; relational databases; data characteristics; query characteristics; real data set; relational databases; selection predicate; selectivity estimation; state-of-the-art technique; string predicates; underestimation problem; Frequency estimation; Relational databases; Resilience; State estimation; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2004. Proceedings. 20th International Conference on
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-2065-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2004.1319999
  • Filename
    1319999