DocumentCode
3190226
Title
Representing Tuple and Attribute Uncertainty in Probabilistic Databases
Author
Sen, Prithviraj ; Deshpande, Amol ; Getoor, Lise
Author_Institution
Univ. of Maryland, College Park
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
507
Lastpage
512
Abstract
There has been a recent surge in work in probabilistic databases, propelled in large part by the huge increase in noisy data sources-sensor data, experimental data, data from uncurated sources, and many others. There is a growing need to be able to flexibly represent the uncertainties in the data, and to efficiently query the data. Building on existing probabilistic database work, we present a unifying framework which allows a flexible representation of correlated tuple and attribute level uncertainties. An important capability of our representation is the ability to represent shared correlation structures in the data. We provide motivating examples to illustrate when such shared correlation structures are likely to exist. Representing shared correlations structures allows the use of sophisticated inference techniques based on lifted probabilistic inference that, in turn, allows us to achieve significant speedups while computing probabilities for results of user-submitted queries.
Keywords
database management systems; inference mechanisms; uncertainty handling; attribute uncertainty; correlation structures; noisy data sources; probabilistic database; probabilistic inference; tuple representation; Computer science; Conferences; Data mining; Data models; Educational institutions; Probability distribution; Query processing; Relational databases; Sensor phenomena and characterization; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
Print_ISBN
978-0-7695-3019-2
Electronic_ISBN
978-0-7695-3033-8
Type
conf
DOI
10.1109/ICDMW.2007.11
Filename
4476715
Link To Document