DocumentCode
3144740
Title
Providing support for full relational algebra in probabilistic databases
Author
Fink, Robert ; Olteanu, Dan ; Rath, Swaroop
Author_Institution
Oxford Univ. Comput. Lab., Oxford, UK
fYear
2011
fDate
11-16 April 2011
Firstpage
315
Lastpage
326
Abstract
Extensive work has recently been done on the evaluation of positive queries on probabilistic databases. The case of queries with negation has notoriously been left out, since it raises serious additional challenges to efficient query evaluation. This paper provides a complete framework for the evaluation of full relational algebra queries in probabilistic databases. In particular, it proposes exact and approximate evaluation techniques for relational algebra queries on representation systems that can accommodate any finite probability space over relational databases. Key ingredients to these techniques are (1) the manipulation of nested propositional expressions used for probability computation without unfolding them into disjunctive normal form, and (2) efficient computation of lower and upper probability bounds of such expressions by deriving coarser expressions in tractable theories such as one occurrence form. We complement our evaluation techniques with a tractability map for relational algebra queries without repeating relation symbols and for quantified queries such as set inclusion, equality, incomparability, and relational division, which are expressible in relational algebra using nested negation and repeating relation symbols. Based on this tractability study, we syntactically define a practical class of tractable relational algebra queries. We incorporated this framework in the SPROUT engine and show its efficiency experimentally in TPC-H and RFID scenarios.
Keywords
probability; query processing; relational algebra; relational databases; SPROUT engine; approximate evaluation techniques; exact evaluation techniques; finite probability space; full relational algebra queries; positive query evaluation; probabilistic databases; relational databases; representation systems; Algebra; Approximation methods; Cost accounting; Probabilistic logic; Query processing; Random variables;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2011 IEEE 27th International Conference on
Conference_Location
Hannover
ISSN
1063-6382
Print_ISBN
978-1-4244-8959-6
Electronic_ISBN
1063-6382
Type
conf
DOI
10.1109/ICDE.2011.5767912
Filename
5767912
Link To Document