• DocumentCode
    3363367
  • Title

    The BofS solution to limitations of approximate summaries

  • Author

    Furtado, Pedro ; Costa, João Furtado

  • Author_Institution
    Dept. Engenharia Informatica, Coimbra Univ., Portugal
  • fYear
    2003
  • fDate
    26-28 March 2003
  • Firstpage
    223
  • Lastpage
    230
  • Abstract
    Data warehouses are of crucial importance to decision-making in competitive organizations. The fact that they store enormous quantities of data is a challenge in what concerns performance and scalability, as users request instant answers. None of the traditional performance strategies is sufficiently good to make complex aggregation queries take only minutes or seconds. The summary warehouse (SW) achieves such a speedup by storing only general-purpose sampling summaries well-fit for aggregated exploration analysis. The major limitation of SWs results from the tradeoff between accuracy and speed: smaller, faster summaries cannot answer less-aggregated queries. In this paper we present the Bag-of-Summaries approach (BofS) designed to deal effectively with this problem: BofS maintains a set of summaries with varied sizes and chooses the right one to answer a query with the desired accuracy and best possible speedup, based on query granularity considerations. We also present experimental results that show the advantage of BofS.
  • Keywords
    data warehouses; query processing; Bag-of-Summaries approach; competitive organizations; complex aggregation queries; data warehouses; decision-making; general-purpose sampling summaries; less-aggregated queries; query granularity considerations; summary warehouse; Aggregates; Data analysis; Data warehouses; Database systems; Decision making; Pattern analysis; Performance analysis; Sampling methods; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Systems for Advanced Applications, 2003. (DASFAA 2003). Proceedings. Eighth International Conference on
  • Conference_Location
    Kyoto, Japan
  • Print_ISBN
    0-7695-1895-8
  • Type

    conf

  • DOI
    10.1109/DASFAA.2003.1192386
  • Filename
    1192386