• DocumentCode
    3323328
  • Title

    Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases

  • Author

    Sarma, Anish Das ; Theobald, Martin ; Widom, Jennifer

  • Author_Institution
    Stanford Univ., Stanford, CA
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    1023
  • Lastpage
    1032
  • Abstract
    We study the problem of computing query results with confidence values in ULDBs: relational databases with uncertainty and lineage. ULDBs, which subsume probabilistic databases, offer an alternative decoupled method of computing confidence values: Instead of computing confidences during query processing, compute them afterwards based on lineage. This approach enables a wider space of query plans, and it permits selective computations when not all confidence values are needed. This paper develops a suite of algorithms and optimizations for a broad class of relational queries on ULDBs. We provide confidence computation algorithms for single data items, as well as efficient batch algorithms to compute confidences for an entire relation or database. All algorithms incorporate memoization to avoid redundant computations, and they have been implemented in the Trio prototype ULDB database system. Performance characteristics and scalability of the algorithms are demonstrated through experimental results over a large synthetic dataset.
  • Keywords
    probability; query processing; relational databases; Trio prototype database system; batch algorithms; confidence computation algorithms; large synthetic dataset; query processing; relational databases; uncertain-probabilistic databases; Cleaning; Data mining; Database systems; Prototypes; Query processing; Relational databases; Scalability; Space exploration; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497511
  • Filename
    4497511