• DocumentCode
    1317244
  • Title

    Making Aggregation Work in Uncertain and Probabilistic Databases

  • Author

    Murthy, Raghotham ; Ikeda, Robert ; Widom, Jennifer

  • Author_Institution
    Dept. of Comput. Sci., Stanford Univ., Stanford, CA, USA
  • Volume
    23
  • Issue
    8
  • fYear
    2011
  • Firstpage
    1261
  • Lastpage
    1273
  • Abstract
    We describe how aggregation is handled in the Trio system for uncertain and probabilistic data. Because “exact” aggregation in uncertain databases can produce exponentially sized results, we provide three alternatives: a low bound on the aggregate value, a high bound on the value, and the expected value. These variants return a single result instead of a set of possible results, and they are generally efficient to compute for both full-table and grouped aggregation queries. We provide formal definitions and semantics and a description of our open source implementation for single-table aggregation queries. We study the performance and scalability of our algorithms through experiments over a large synthetic data set. We also provide some preliminary results on aggregations over joins.
  • Keywords
    query processing; statistical databases; Trio system; aggregation work; full-table aggregation queries; grouped aggregation queries; probabilistic databases; uncertain databases; Aggregates; Data models; Databases; Image color analysis; Probabilistic logic; Semantics; Uncertainty; Database management; query processing.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2010.166
  • Filename
    5567105