Title :
Link-Heterogeneous Work Stealing
Author :
Trong-Tuan Vu ; Derbel, Bilel
Author_Institution :
INRIA Lille Nord Eur., Lille, France
Abstract :
Random work-stealing has been proved to be extremely beneficial in dynamically load-balancing irregular applications. However, it is known to perform loosely in non-homogenous distributed systems where communications costs are a major obstacle for high performance. In this paper, we investigate the design of an effective work-stealing protocol dealing with the heterogeneity of network link latencies. We propose a generic distributed algorithm which can be easily implemented to fit different types of heterogeneity. The proposed algorithm extends on reference approaches, namely Probabilistic Work Stealing (PWS), and Adaptive Cluster-aware Random Stealing (ACRS), by introducing new adaptive control operations that are shown to be highly accurate in increasing work locality and decreasing steals cost. We provide a comprehensive analysis including: (i) a comparative study on a broad range of harsh network scenarios, and (ii) an in-depth analysis of protocols´ behavior at the aim of gaining new insights into dynamic load-balancing in heterogeneous distributed environments. Over all experimented configurations, our results show that although the proposed protocol is not tailored for a specific networked platform, it can save 30% execution time in average compared to its competitors, while demonstrating high quality self-adjusting capabilities.
Keywords :
adaptive control; distributed algorithms; resource allocation; ACRS approach; PWS approach; adaptive cluster-aware random stealing approach; adaptive control operations; dynamically load-balancing irregular applications; generic distributed algorithm; link-heterogeneous work stealing; network link latencies; nonhomogenous distributed systems; probabilistic work stealing approach; random work-stealing; work locality; work-stealing protocol; Clustering algorithms; Computational modeling; Context; Load modeling; Peer-to-peer computing; Probabilistic logic; Protocols; B&B; Load-balancing; UTS; work-stealing;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
DOI :
10.1109/CCGrid.2014.85