DocumentCode
2720614
Title
Scalable parallelization of skyline computation for multi-core processors
Author
Chester, Sean ; Sidlauskas, Darius ; Assent, Ira ; Bogh, Kenneth S.
Author_Institution
Data-Intensive Syst. Group, Aarhus Univ., Aarhus, Denmark
fYear
2015
fDate
13-17 April 2015
Firstpage
1083
Lastpage
1094
Abstract
The skyline is an important query operator for multi-criteria decision making. It reduces a dataset to only those points that offer optimal trade-offs of dimensions. In general, it is very expensive to compute. Recently, multicore CPU algorithms have been proposed to accelerate the computation of the skyline. However, they do not sufficiently minimize dominance tests and so are not competitive with state-of-the-art sequential algorithms. In this paper, we introduce a novel multicore skyline algorithm, Hybrid, which processes points in blocks. It maintains a shared, global skyline among all threads, which is used to minimize dominance tests while maintaining high throughput. The algorithm uses an efficiently-updatable data structure over the shared, global skyline, based on point-based partitioning. Also, we release a large benchmark of optimized skyline algorithms, with which we demonstrate on challenging workloads a 100-fold speedup over state-of-the-art multicore algorithms and a 10-fold speedup with 16 cores over state-of-the-art sequential algorithms.
Keywords
multi-threading; multiprocessing systems; query processing; data structure; dominance test minimization; hybrid algorithm; multicore CPU algorithms; multicore processors; multicriteria decision making; optimized skyline algorithms; point-based partitioning; query operator; scalable parallelization; shared-global skyline; skyline computation; throughput; Data structures; Fuels; Parallel processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location
Seoul
Type
conf
DOI
10.1109/ICDE.2015.7113358
Filename
7113358
Link To Document