DocumentCode :
656136
Title :
Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor Contractions
Author :
Ozog, David ; Hammond, Jeff R. ; Dinan, James ; Balaji, Pavan ; Shende, Sameer ; Malony, Allen
Author_Institution :
Univ. of Oregon, Eugene, OR, USA
fYear :
2013
fDate :
1-4 Oct. 2013
Firstpage :
30
Lastpage :
39
Abstract :
Developing effective yet scalable load-balancing methods for irregular computations is critical to the successful application of simulations in a variety of disciplines at petascale and beyond. This paper explores a set of static and dynamic scheduling algorithms for block-sparse tensor contractions within the NWChem computational chemistry code for different degrees of sparsity (and therefore load imbalance). In this particular application, a relatively large amount of task information can be obtained at minimal cost, which enables the use of static partitioning techniques that take the entire task list as input. However, fully static partitioning is incapable of dealing with dynamic variation of task costs, such as from transient network contention or operating system noise, so we also consider hybrid schemes that utilize dynamic scheduling within subgroups. These two schemes, which have not been previously implemented in NWChem or its proxies (i.e. quantum chemistry mini-apps) are compared to the original centralized dynamic load-balancing algorithm as well as improved centralized scheme. In all cases, we separate the scheduling of tasks from the execution of tasks into an inspector phase and an executor phase. The impact of these methods upon the application is substantial on a large InfiniBand cluster: execution time is reduced by as much as 50% at scale. The technique is applicable to any scientific application requiring load balance where performance models or estimations of kernel execution times are available.
Keywords :
chemistry computing; resource allocation; scheduling; InfiniBand cluster; NWChem computational chemistry code; block-sparse tensor contractions; dynamic scheduling algorithms; inspector-executor load balancing algorithms; kernel execution times; operating system noise; sparsity degree; static partitioning techniques; static scheduling algorithms; task scheduling; transient network contention; Computational efficiency; Computational modeling; Load management; Load modeling; Radiation detectors; Tensile stress; Tiles; Dynamic Load Balancing; Global Arrays; Quantum Chemistry; Static Partitioning; Tensor Contractions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location :
Lyon
ISSN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2013.12
Filename :
6687336
Link To Document :
بازگشت