Title :
An Efficient Algorithm for top-k Queries on Uncertain Data Streams
Author :
Caiyan Dai ; Ling Chen ; Yixin Chen ; Keming Tang
Author_Institution :
Coll. of Comput. Sci. & Technol., Nanjing Univ. of Aeronaut. & Astronaut., Nanjing, China
Abstract :
We tackle the problem of answering maximum probabilistic top-k tuple set queries. We use a sliding-window model on uncertain data streams and present an efficient algorithm for processing sliding-window queries on uncertain streams. In each sliding window, the algorithm selects the k tuples with the highest probabilities from sets of different numbers of the tuples with the highest scores. Then, the algorithm computes existential probability of the top-k tuples, and chooses the set with the highest probability as the top-k query result. We theoretically prove the correctness of the algorithm. Our experimental results show that our algorithm requires lower time and space complexity than other existing algorithms.
Keywords :
computational complexity; data mining; query processing; set theory; maximum probabilistic top-k tuple set query answering; sliding-window model; sliding-window query processing; space complexity; time complexity; top-k query algorithm; uncertain data streams; Algorithm design and analysis; Complexity theory; Databases; Educational institutions; Heuristic algorithms; Probability; Software algorithms; slidingwindow; top-k queries; uncertain data streams;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
DOI :
10.1109/ICMLA.2012.57