• DocumentCode
    945828
  • Title

    Materialized Sample Views for Database Approximation

  • Author

    Joshi, Shantanu ; Jermaine, Christopher

  • Author_Institution
    Oracle Corp., Redwood
  • Volume
    20
  • Issue
    3
  • fYear
    2008
  • fDate
    3/1/2008 12:00:00 AM
  • Firstpage
    337
  • Lastpage
    351
  • Abstract
    We consider the problem of creating a sample view of a database table. A sample view is an indexed materialized view that permits efficient sampling from an arbitrary range query over the view. Such "sample views" are very useful in applications that require random samples from a database: approximate query processing, online aggregation, data mining, and randomized algorithms are a few examples. Our core technical contribution is a new file organization called the appendability, combinability, and exponentiality (ACE) tree that is suitable for organizing and indexing a sample view. One of the most important aspects of the ACE tree is that it supports online random sampling from the view. That is, at all times, the set of records returned by the ACE tree constitutes a statistically random sample of the database records satisfying the relational selection predicate over the view. Our paper presents experimental results that demonstrate the utility of the ACE tree.
  • Keywords
    database management systems; file organisation; query processing; appendability combinability and exponentiality tree; data mining; database table sample views; file organization; online aggregation; query processing; randomized algorithms; Indexing methods; Query processing; Sampling;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.190664
  • Filename
    4358950