DocumentCode
580069
Title
Scalable work stealing
Author
Dinan, James ; Larkins, D.B. ; Sadayappan, P. ; Krishnamoorthy, Sriram ; Nieplocha, Jarek
Author_Institution
Dept. Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear
2009
fDate
14-20 Nov. 2009
Firstpage
1
Lastpage
11
Abstract
Irregular and dynamic parallel applications pose significant challenges to achieving scalable performance on large-scale multicore clusters. These applications often require ongoing, dynamic load balancing in order to maintain efficiency. Scalable dynamic load balancing on large clusters is a challenging problem which can be addressed with distributed dynamic load balancing systems. Work stealing is a popular approach to distributed dynamic load balancing; however its performance on large-scale clusters is not well understood. Prior work on work stealing has largely focused on shared memory machines. In this work we investigate the design and scalability of work stealing on modern distributed memory systems. We demonstrate high efficiency and low overhead when scaling to 8,192 processors for three benchmark codes: a producer-consumer benchmark, the unbalanced tree search benchmark, and a multiresolution analysis kernel.
Keywords
distributed shared memory systems; parallel processing; resource allocation; tree searching; distributed dynamic load balancing systems; distributed memory systems; dynamic parallel applications; irregular parallel applications; large-scale multicore clusters; multiresolution analysis kernel; producer-consumer benchmark; scalable dynamic load balancing; scalable work stealing; shared memory machines; unbalanced tree search benchmark; ARMCI; PGAS; dynamic load balancing; global arrays; task pools; work stealing;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
Conference_Location
Portland, OR
Type
conf
DOI
10.1145/1654059.1654113
Filename
6375517
Link To Document