DocumentCode
2786072
Title
Synchronous Parallel Processing of Big-Data Analytics Services to Optimize Performance in Federated Clouds
Author
Jung, Gueyoung ; Gnanasambandam, Nathan ; Mukherjee, Tridib
Author_Institution
Xerox Res. Center Webster, Webster, MA, USA
fYear
2012
fDate
24-29 June 2012
Firstpage
811
Lastpage
818
Abstract
Parallelization of big-data analytics services over a federation of heterogeneous clouds has been considered to improve performance. However, contrary to common intuition, there is an inherent tradeoff between the level of parallelism and the performance for big-data analytics principally because of a significant delay for big-data to get transferred over the network. The data transfer delay can be comparable or even higher than the time required to compute data. To address the aforementioned tradeoff, this paper determines: (a) how many and which computing nodes in federated clouds should be used for parallel execution of big-data analytics; (b) opportunistic apportioning of big-data to these computing nodes in a way to enable synchronized completion at best-effort performance; and (c) sequence of apportioned, different sizes of big-data chunks to be computed in each node so that transfer of a chunk is overlapped as much as possible with the computation of the previous chunk in the node. In this regard, Maximally Overlapped Bin-packing driven Bursting (MOBB) algorithm is proposed, which improve the performance by up to 60% against existing approaches.
Keywords
bin packing; cloud computing; data analysis; software performance evaluation; MOBB algorithm; big-data analytics services; big-data chunks; computing nodes; data transfer delay; federated clouds; heterogeneous clouds; maximally overlapped bin-packing driven bursting algorithm; opportunistic apportioning; parallel execution; performance improvement; performance optimization; synchronous parallel processing; Computational modeling; Data mining; Delay; Estimation; Parallel processing; Sorting; Synchronization; big-data analytics; federated clouds; parallelization;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on
Conference_Location
Honolulu, HI
ISSN
2159-6182
Print_ISBN
978-1-4673-2892-0
Type
conf
DOI
10.1109/CLOUD.2012.108
Filename
6253583
Link To Document