DocumentCode
560191
Title
A ‘cool’ load balancer for parallel applications
Author
Sarood, Osman ; Kale, Laxmikant V.
Author_Institution
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2011
fDate
12-18 Nov. 2011
Firstpage
1
Lastpage
11
Abstract
Meeting power requirements of huge exascale machines of the future will be a major challenge. Our focus in this paper is to minimize cooling power and we propose a technique that uses a combination of DVFS and temperature aware load balancing to constrain core temperatures as well as save cooling energy. Our scheme is specifically designed to suit parallel applications which are typically tightly coupled. The temperature control, comes at the cost of execution time and we try to minimize the timing penalty. We experiment with three applications (with different power utilization profiles), run on a 128-core (32-node) cluster with a dedicated air conditioning unit. We calibrate the efficacy of our scheme based on three metrics: ability to control average core temperatures thereby avoiding hot spot occurrence, timing penalty minimization, and cooling energy savings. Our results show cooling energy savings of up to 57% with a timing penalty of 19%.
Keywords
air conditioning; computer centres; cooling; multiprocessing systems; parallel processing; resource allocation; temperature control; 128-core cluster; 32-node cluster; DVFS; average core temperature control; cooling energy savings; cooling power minimization; core temperatures; dedicated air conditioning unit; exascale machines; hot spot occurrence; load balancer; parallel applications; power requirements; power utilization profiles; temperature aware load balancing; timing penalty minimization; Cooling; Heating; Load management; Temperature measurement; Time frequency analysis; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
Conference_Location
Seatle, WA
Electronic_ISBN
978-1-4503-0771-0
Type
conf
Filename
6114458
Link To Document