DocumentCode :
1484861
Title :
Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
Author :
Ding, Xiaofeng ; Jin, Hai
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan, China
Volume :
24
Issue :
8
fYear :
2012
Firstpage :
1448
Lastpage :
1462
Abstract :
The skyline operator has received considerable attention from the database community, due to its importance in many applications including multicriteria decision making, preference answering, and so forth. In many applications where uncertain data are inherently exist, i.e., data collected from different sources in distributed locations are usually with imprecise measurements, and thus exhibit kind of uncertainty. Taking into account the network delay and economic cost associated with sharing and communicating large amounts of distributed data over an internet, an important problem in this scenario is to retrieve the global skyline tuples from all the distributed local sites with minimum communication cost. Based on the well-known notation of the probabilistic skyline query over centralized uncertain data, in this paper, we propose the notation of distributed skyline queries over uncertain data. Furthermore, two communication- and computation-efficient algorithms are proposed to retrieve the qualified skylines from distributed local sites. Extensive experiments have been conducted to verify the efficiency, the effectiveness and the progressiveness of our algorithms with both the synthetic and real data sets.
Keywords :
Internet; decision making; distributed databases; probability; query processing; Internet; centralized uncertain data; communication-efficient algorithm; computation-efficient algorithm; distributed data sharing; distributed local sites; distributed skyline queries; economic cost; global skyline tuples; minimum communication cost; multicriteria decision making; network delay; preference answering; probabilistic skyline query; skyline operator; Algorithm design and analysis; Bandwidth; Distributed databases; Probabilistic logic; Servers; Uncertainty; Skyline; distributed database; uncertain data.;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.77
Filename :
5740889
Link To Document :
بازگشت