DocumentCode :
2535491
Title :
System-Level, Unified In-band and Out-of-band Dynamic Thermal Control
Author :
Li, Dong ; Ge, Rong ; Cameron, Kirk
fYear :
2010
fDate :
13-16 Sept. 2010
Firstpage :
131
Lastpage :
140
Abstract :
High-density computer racks become increasingly commonplace in supercomputing centers and data centers. With tight integration of high-powered computing components in the racks, hot spots or pockets of elevated temperatures on the chips and system can be easily formed when room air circulation is not effective. Hot spots reduce the reliability of high-density systems and increase the chances of thermal emergencies, which further trigger system slowdowns or shutdowns. Techniques such as dynamically scaling down the voltage of the CPUs and fan control are available on today´s systems to reduce heat generation and dissipate heat. Unfortunately, these techniques work independently on their own without cooperation. As a result, to prevent thermal emergencies, systems may work at reduced capacity when full capacity is required. We propose a combined in-band and out-of-band approach to reduce the likelihood of thermal emergency slowdowns and improve the reliability of systems. Our thermal control framework unifies temperature control mechanisms in systems to balance temperature, power consumption, and performance. More precisely, we balance the use of in-band dynamic voltage and frequency scaling (DVFS) with out-of-band proactive fan control. Our results on a power-aware cluster indicate the coordinated use of fan control and DVFS is more effective than either technique in isolation at reducing average system operating temperatures with expected performance.
Keywords :
computer centres; mainframes; power aware computing; temperature control; DVFS; data centers; fan control; heat dissipation; heat generation; high-density computer racks; high-powered computing components; in-band dynamic voltage and frequency scaling; out-of-band dynamic thermal control; out-of-band proactive fan control; power consumption; power-aware cluster; room air circulation; supercomputing centers; system-level unified in-band thermal control; temperature control mechanisms; Arrays; Frequency control; Servers; Temperature control; Temperature measurement; Temperature sensors; Thermal management; fan control; thermal-aware computing; unified thermal control framework;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2010 39th International Conference on
Conference_Location :
San Diego, CA
ISSN :
0190-3918
Print_ISBN :
978-1-4244-7913-9
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2010.22
Filename :
5599157
Link To Document :
بازگشت