DocumentCode :
2492615
Title :
Recording How-Provenance on Probabilistic Databases
Author :
Gao, Ming ; He, Xiangnan ; Jin, Cheqing ; Wang, XiaoLing ; Zhou, Aoying
Author_Institution :
Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
fYear :
2010
fDate :
6-8 April 2010
Firstpage :
205
Lastpage :
211
Abstract :
Tracking data provenance (or lineage) has become increasingly important in many large-scale applications, and a few methods have been proposed to record data provenance recently. However, most of previous works mainly focus on deterministic databases except Trio style lineage that aims at probabilistic databases, which is much more challenging because of the exponential growth of possible world instances and dependence among intermediate tuples. This paper proposes an approach, named PHP-tree, to model how-provenance upon probabilistic databases. we also show how to evaluate probability based on a PHP-tree. Compared with Trio style lineage, our approach is independent of intermediate results and can calculate the probability both cases of restricted and complete propagation of data provenance. Detailed experimental results show the effectiveness, efficiency and scalability of our proposed model.
Keywords :
data warehouses; probability; PHP-tree; data provenance; deterministic databases; large scale applications; probabilistic databases; recording how provenance; trio style lineage; Color; Deductive databases; Helium; Information processing; Laboratories; Large-scale systems; Probability; Relational databases; Software engineering; Uncertainty; How Provenance; Probability evaluation; probabilistic databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Conference (APWEB), 2010 12th International Asia-Pacific
Conference_Location :
Busan
Print_ISBN :
978-1-7695-4012-2
Electronic_ISBN :
978-1-4244-6600-9
Type :
conf
DOI :
10.1109/APWeb.2010.19
Filename :
5474134
Link To Document :
بازگشت