• DocumentCode
    3144740
  • Title

    Providing support for full relational algebra in probabilistic databases

  • Author

    Fink, Robert ; Olteanu, Dan ; Rath, Swaroop

  • Author_Institution
    Oxford Univ. Comput. Lab., Oxford, UK
  • fYear
    2011
  • fDate
    11-16 April 2011
  • Firstpage
    315
  • Lastpage
    326
  • Abstract
    Extensive work has recently been done on the evaluation of positive queries on probabilistic databases. The case of queries with negation has notoriously been left out, since it raises serious additional challenges to efficient query evaluation. This paper provides a complete framework for the evaluation of full relational algebra queries in probabilistic databases. In particular, it proposes exact and approximate evaluation techniques for relational algebra queries on representation systems that can accommodate any finite probability space over relational databases. Key ingredients to these techniques are (1) the manipulation of nested propositional expressions used for probability computation without unfolding them into disjunctive normal form, and (2) efficient computation of lower and upper probability bounds of such expressions by deriving coarser expressions in tractable theories such as one occurrence form. We complement our evaluation techniques with a tractability map for relational algebra queries without repeating relation symbols and for quantified queries such as set inclusion, equality, incomparability, and relational division, which are expressible in relational algebra using nested negation and repeating relation symbols. Based on this tractability study, we syntactically define a practical class of tractable relational algebra queries. We incorporated this framework in the SPROUT engine and show its efficiency experimentally in TPC-H and RFID scenarios.
  • Keywords
    probability; query processing; relational algebra; relational databases; SPROUT engine; approximate evaluation techniques; exact evaluation techniques; finite probability space; full relational algebra queries; positive query evaluation; probabilistic databases; relational databases; representation systems; Algebra; Approximation methods; Cost accounting; Probabilistic logic; Query processing; Random variables;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2011 IEEE 27th International Conference on
  • Conference_Location
    Hannover
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4244-8959-6
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2011.5767912
  • Filename
    5767912