DocumentCode
633074
Title
Efficient Probabilistic Skyline Query Processing in MapReduce
Author
Linlin Ding ; Guoren Wang ; Junchang Xin ; Ye Yuan
Author_Institution
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
fYear
2013
fDate
June 27 2013-July 2 2013
Firstpage
203
Lastpage
210
Abstract
As a popular parallel programming model, how to process probabilistic skyline query over uncertain data in MapReduce framework is becoming an urgent problem to be resolved. In MapReduce framework, implementing probabilistic skyline query is nontrivial since the probabilistic skyline query is not decomposable. Therefore, in this paper, we propose a filter-refine two phases approach in MapReduce that translates the probabilistic skyline query into two decomposable computations for obtaining the final results. Firstly, we describe the whole processing procedure of filter-refine, and then propose an efficient probabilistic skyline query processing algorithm in MapReduce. Furthermore, to reduce the computation and communication cost, we develop the optimized probabilistic skyline query processing algorithm to prune the unpromising data both in filter and refine phases. Finally, we conduct extensive experiments on synthetic data to verify the effectiveness and efficiency of the proposed filter-refine approach with various experimental settings.
Keywords
parallel programming; probability; query processing; MapReduce framework; filter-refine processing procedure; parallel programming model; probabilistic skyline query processing; Data models; Educational institutions; Probabilistic logic; Probability; Query processing; Silicon; MapReduce; probabilistic skyline; uncertain data;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (BigData Congress), 2013 IEEE International Congress on
Conference_Location
Santa Clara, CA
Print_ISBN
978-0-7695-5006-0
Type
conf
DOI
10.1109/BigData.Congress.2013.35
Filename
6597138
Link To Document