Title :
Using Machine Affinity to Increase Science Throughput (Machine Affinity Characterization of the HPCMP Workload)
Author :
Snavely, A. ; Gamst, Anthony ; Carrington, Laura ; Tikir, M. ; Laurenzano, Michael
Author_Institution :
San Diego Supercomput. Center, Univ. of California, San Diego, CA, USA
Abstract :
Machine affinity is the observed phenomena that some applications benefit more than others from features of high performance computing (HPC) architectures. When considering a diverse portfolio of HPC machines manufactured by different vendors and of different ages, such as the set of all supercomputers currently operated by the Department of Defense High Performance Computing Modernization Program, it should be obvious that some run a given application faster than others do. Therefore, almost every user would request to run on the fastest machines. But an important insight is that some applications benefit more from the features of the faster machines than others do. If allocations are done in such a way that applications that benefit the most from the features of the fastest machines are assigned to those machines then overall throughput across all machines is boosted by more than 10%. We exhibit exemplary empirical analysis and provide a simple algorithm for doing allocations based on machine affinity. The net effect is like adding a new $10M supercomputer to the portfolio without paying for it.
Keywords :
military computing; multiprocessing systems; parallel machines; Department of Defense High Performance Computing Modernization Program; HPCMP workload; machine affinity characterization; science throughput; Mathematical model; Portfolios; Program processors; Proposals; Resource management; Runtime; Throughput; affinity; job scheduling; performance modeling; supercomputing;
Conference_Titel :
High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC), 2010 DoD
Conference_Location :
Schaumburg, IL
Print_ISBN :
978-1-61284-986-7
DOI :
10.1109/HPCMP-UGC.2010.53