DocumentCode
3268216
Title
Selectivity estimation for string predicates: overcoming the underestimation problem
Author
Chaudhuri, Surajit ; Ganti, Venkatesh ; Gravano, Luis
fYear
2004
fDate
30 March-2 April 2004
Firstpage
227
Lastpage
238
Abstract
Queries with (equality or LIKE) selection predicates over string attributes are widely used in relational databases. However, state-of-the-art techniques for estimating selectivities of string predicates are often biased towards severely underestimating selectivities. We develop accurate selectivity estimators for string predicates that adapt to data and query characteristics, and which can exploit and build on a variety of existing estimators. A thorough experimental evaluation over real data sets demonstrates the resilience of our estimators to variations in both data and query characteristics.
Keywords
query processing; relational databases; data characteristics; query characteristics; real data set; relational databases; selection predicate; selectivity estimation; state-of-the-art technique; string predicates; underestimation problem; Frequency estimation; Relational databases; Resilience; State estimation; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN
1063-6382
Print_ISBN
0-7695-2065-0
Type
conf
DOI
10.1109/ICDE.2004.1319999
Filename
1319999
Link To Document