DocumentCode
1040415
Title
Efficient Skyline and Top-k Retrieval in Subspaces
Author
Tao, Yufei ; Xiao, Xiaokui ; Pei, Jian
Author_Institution
Chinese Univ. of Hong Kong, Sha Tin
Volume
19
Issue
8
fYear
2007
Firstpage
1072
Lastpage
1088
Abstract
Skyline and top-k queries are two popular operations for preference retrieval. In practice, applications that require these operations usually provide numerous candidate attributes, whereas, depending on their interests, users may issue queries regarding different subsets of the dimensions. The existing algorithms are inadequate for subspace skyline/top-k search because they have at least one of the following defects: 1) they require scanning the entire database at least once, 2) they are optimized for one subspace but incur significant overhead for other subspaces, or 3) they demand expensive maintenance cost or space consumption. In this paper, we propose a technique SUBSKY, which settles both types of queries by using purely relational technologies. The core of SUBSKY is a transformation that converts multidimensional data to one-dimensional (1D) values. These values are indexed by a simple B-tree, which allows us to answer subspace queries by accessing a fraction of the database. SUBSKY entails low maintenance overhead, which equals the cost of updating a traditional B-tree. Extensive experiments with real data confirm that our technique outperforms alternative solutions significantly in both efficiency and scalability.
Keywords
query processing; relational databases; tree data structures; B-tree; SUBSKY technique; skyline queries; top-k retrieval; Cities and towns; Cost function; Data security; Information retrieval; Multidimensional systems; Relational databases; Scalability; Space technology; B-tree.; Skyline; subspace; top-k;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2007.1051
Filename
4262537
Link To Document