Title :
Making Aggregation Work in Uncertain and Probabilistic Databases
Author :
Murthy, Raghotham ; Ikeda, Robert ; Widom, Jennifer
Author_Institution :
Dept. of Comput. Sci., Stanford Univ., Stanford, CA, USA
Abstract :
We describe how aggregation is handled in the Trio system for uncertain and probabilistic data. Because “exact” aggregation in uncertain databases can produce exponentially sized results, we provide three alternatives: a low bound on the aggregate value, a high bound on the value, and the expected value. These variants return a single result instead of a set of possible results, and they are generally efficient to compute for both full-table and grouped aggregation queries. We provide formal definitions and semantics and a description of our open source implementation for single-table aggregation queries. We study the performance and scalability of our algorithms through experiments over a large synthetic data set. We also provide some preliminary results on aggregations over joins.
Keywords :
query processing; statistical databases; Trio system; aggregation work; full-table aggregation queries; grouped aggregation queries; probabilistic databases; uncertain databases; Aggregates; Data models; Databases; Image color analysis; Probabilistic logic; Semantics; Uncertainty; Database management; query processing.;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2010.166