Title :
Probabilistic Ranking Queries on Gaussians
Author :
Böhm, Christian ; Pryakhin, Alexey ; Schubert, Matthias
Author_Institution :
Inst. for Informatics, Univ. of Munich
Abstract :
In many modern applications, there are no exact values available to describe the data objects. Instead, the feature values are considered to be uncertain. This uncertainty is modeled by probability distributions instead of exact feature values. A typical application of such an uncertainty model are moving objects where the exact position of each object can be determined only at discrete time intervals. Queries often involve the positions of objects between two such time stamps or after the last known time stamp. Then the objects are essentially uncertain unless the pattern of movement is very simple (e.g. linear). One of the most important probability density functions for those applications is the Gaussian or normal distribution which can be defined by a mean value and a standard deviation. In this paper, we examine a new type of queries on uncertain data objects, called probability ranking queries (PRQ). A PRQ retrieves those k objects which have the highest probability of being located inside a given query area. To speed up probabilistic queries on large sets of uncertain data objects described by Gaussians, we introduce a novel index structure called Gauss-tree. Furthermore, we provide an algorithm for employing the Gauss-tree to answer PRQs. In our experimental evaluation, we demonstrate that the Gauss-tree achieves a considerable efficiency advantage with respect to PRQs compared to other applicable methods
Keywords :
Gaussian distribution; database indexing; normal distribution; query processing; tree data structures; Gauss-tree; Gaussian distribution; index structure; normal distribution; probabilistic ranking query; probability density function; probability distribution; standard deviation; uncertain data object; Distribution functions; Gas detectors; Gaussian distribution; Gaussian processes; Informatics; Probability density function; Probability distribution; Spatial databases; Temperature sensors; Uncertainty;
Conference_Titel :
Scientific and Statistical Database Management, 2006. 18th International Conference on
Conference_Location :
Vienna
Print_ISBN :
0-7695-2590-3
DOI :
10.1109/SSDBM.2006.40