DocumentCode
945828
Title
Materialized Sample Views for Database Approximation
Author
Joshi, Shantanu ; Jermaine, Christopher
Author_Institution
Oracle Corp., Redwood
Volume
20
Issue
3
fYear
2008
fDate
3/1/2008 12:00:00 AM
Firstpage
337
Lastpage
351
Abstract
We consider the problem of creating a sample view of a database table. A sample view is an indexed materialized view that permits efficient sampling from an arbitrary range query over the view. Such "sample views" are very useful in applications that require random samples from a database: approximate query processing, online aggregation, data mining, and randomized algorithms are a few examples. Our core technical contribution is a new file organization called the appendability, combinability, and exponentiality (ACE) tree that is suitable for organizing and indexing a sample view. One of the most important aspects of the ACE tree is that it supports online random sampling from the view. That is, at all times, the set of records returned by the ACE tree constitutes a statistically random sample of the database records satisfying the relational selection predicate over the view. Our paper presents experimental results that demonstrate the utility of the ACE tree.
Keywords
database management systems; file organisation; query processing; appendability combinability and exponentiality tree; data mining; database table sample views; file organization; online aggregation; query processing; randomized algorithms; Indexing methods; Query processing; Sampling;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2007.190664
Filename
4358950
Link To Document