• DocumentCode
    168658
  • Title

    Link-Heterogeneous Work Stealing

  • Author

    Trong-Tuan Vu ; Derbel, Bilel

  • Author_Institution
    INRIA Lille Nord Eur., Lille, France
  • fYear
    2014
  • fDate
    26-29 May 2014
  • Firstpage
    354
  • Lastpage
    363
  • Abstract
    Random work-stealing has been proved to be extremely beneficial in dynamically load-balancing irregular applications. However, it is known to perform loosely in non-homogenous distributed systems where communications costs are a major obstacle for high performance. In this paper, we investigate the design of an effective work-stealing protocol dealing with the heterogeneity of network link latencies. We propose a generic distributed algorithm which can be easily implemented to fit different types of heterogeneity. The proposed algorithm extends on reference approaches, namely Probabilistic Work Stealing (PWS), and Adaptive Cluster-aware Random Stealing (ACRS), by introducing new adaptive control operations that are shown to be highly accurate in increasing work locality and decreasing steals cost. We provide a comprehensive analysis including: (i) a comparative study on a broad range of harsh network scenarios, and (ii) an in-depth analysis of protocols´ behavior at the aim of gaining new insights into dynamic load-balancing in heterogeneous distributed environments. Over all experimented configurations, our results show that although the proposed protocol is not tailored for a specific networked platform, it can save 30% execution time in average compared to its competitors, while demonstrating high quality self-adjusting capabilities.
  • Keywords
    adaptive control; distributed algorithms; resource allocation; ACRS approach; PWS approach; adaptive cluster-aware random stealing approach; adaptive control operations; dynamically load-balancing irregular applications; generic distributed algorithm; link-heterogeneous work stealing; network link latencies; nonhomogenous distributed systems; probabilistic work stealing approach; random work-stealing; work locality; work-stealing protocol; Clustering algorithms; Computational modeling; Context; Load modeling; Peer-to-peer computing; Probabilistic logic; Protocols; B&B; Load-balancing; UTS; work-stealing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/CCGrid.2014.85
  • Filename
    6846471