DocumentCode :
2995900
Title :
MapReduce Skyline Query Processing with a New Angular Partitioning Approach
Author :
Chen, Liang ; Hwang, Kai ; Wu, Jian
Author_Institution :
Zhejiang Univ., Hangzhou, China
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
2262
Lastpage :
2270
Abstract :
Fast skyline selection of high-quality web services is of critically importance to upgrade e-commerce and various cloud applications. In this paper, we present a new MapReduce Skyline method for scalable parallel skyline query processing. Our new angular partitioning of the data space reduces the processing time in selecting optimal skyline services. Our method shortens the Reduce time significantly due to the elimination of more redundant dominance computations. Through Hadoop experiments on large server clusters, our method scales well with the increase of both attribute dimensionality and data-space cardinality. We define a new performance metric to assess the local optimality of selected skyline services. By experimenting over 10,000 real-life web service applications over 10 performance attribute dimensions, we find that the angular-partitioned MapReduce method is 1.7 and 2.3 times faster than the dimensional and grid partitioning methods, respectively with a higher probability to reach the local optimality. These results are very encouraging to select optimal web services in real-time out of a large number of web services.
Keywords :
Web services; cloud computing; data handling; electronic commerce; parallel processing; probability; query processing; Hadoop experiment; MapReduce skyline query processing; Web service applications; angular partitioning approach; attribute dimensionality; cloud application; data space angular partitioning; data-space cardinality; e-commerce; fast skyline selection; grid partitioning method; high-quality Web service; local optimality assessment; optimal skyline service; performance attribute dimension; performance metric; probability; processing time reduction; scalable parallel skyline query processing; server cluster; Complexity theory; Partitioning algorithms; Quality of service; Query processing; Servers; Time factors; Web services; Hadoop programming; MapReduce; Web services; skyline query processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
Type :
conf
DOI :
10.1109/IPDPSW.2012.279
Filename :
6270590
Link To Document :
بازگشت