DocumentCode :
945828
Title :
Materialized Sample Views for Database Approximation
Author :
Joshi, Shantanu ; Jermaine, Christopher
Author_Institution :
Oracle Corp., Redwood
Volume :
20
Issue :
3
fYear :
2008
fDate :
3/1/2008 12:00:00 AM
Firstpage :
337
Lastpage :
351
Abstract :
We consider the problem of creating a sample view of a database table. A sample view is an indexed materialized view that permits efficient sampling from an arbitrary range query over the view. Such "sample views" are very useful in applications that require random samples from a database: approximate query processing, online aggregation, data mining, and randomized algorithms are a few examples. Our core technical contribution is a new file organization called the appendability, combinability, and exponentiality (ACE) tree that is suitable for organizing and indexing a sample view. One of the most important aspects of the ACE tree is that it supports online random sampling from the view. That is, at all times, the set of records returned by the ACE tree constitutes a statistically random sample of the database records satisfying the relational selection predicate over the view. Our paper presents experimental results that demonstrate the utility of the ACE tree.
Keywords :
database management systems; file organisation; query processing; appendability combinability and exponentiality tree; data mining; database table sample views; file organization; online aggregation; query processing; randomized algorithms; Indexing methods; Query processing; Sampling;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2007.190664
Filename :
4358950
Link To Document :
بازگشت