Title :
Fighting fire with fire: Modeling the datacenter-scale effects of targeted superlattice thermal management
Author :
Biswas, Susmit ; Tiwari, Mohit ; Sherwood, Timothy ; Theogarajan, Luke ; Chong, Frederic T.
Author_Institution :
Lawrence Livermore Nat. Lab., Lawrence, CA, USA
Abstract :
Local thermal hot-spots in microprocessors lead to worst-case provisioning of global cooling resources, especially in large-scale systems where cooling power can be 50~100% of IT power. Further, the efficiency of cooling solutions degrade non-linearly with supply temperature. Recent advances in active cooling techniques have shown on-chip thermoelectric coolers (TECs) to be very efficient at selectively eliminating small hot-spots. Applying current to a superlattice TEC-film that is deposited between silicon and the heat spreader results in a Peltier effect, which spreads the heat and lowers the temperature of the hot-spot significantly and improves chip reliability. In this paper, we propose that hot-spot mitigation using thermoelectric coolers can be used as a power management mechanism to allow global coolers to be provisioned for a better worst case temperature leading to substantial savings in cooling power. In order to quantify the potential power savings from using TECs in data center servers, we present a detailed power model that integrates on-chip dynamic and leakage power sources, heat diffusion through the entire chip, TEC and global cooler efficiencies, and all their mutual interactions. Our multi-scale analysis shows that, for a typical data center, TECs allow global coolers to operate at higher temperatures without degrading chip lifetime, and thus save ~27% cooling power on average while providing the same processor reliability as a data center running at 288K.
Keywords :
computer centres; integrated circuit reliability; microprocessor chips; power aware computing; thermal management (packaging); thermoelectric cooling; Peltier effect; chip reliability; cooling power; cooling solutions; datacenter-scale effects; global cooling resources; heat diffusion; heat spreader; hot-spot mitigation; large-scale systems; leakage power sources; local thermal hot-spots; microprocessors; on-chip dynamic; on-chip thermoelectric coolers; silicon; superlattice TEC-film; targeted superlattice thermal management; Atmospheric modeling; Heat sinks; Heating; Mathematical model; Servers; Temperature distribution;
Conference_Titel :
Computer Architecture (ISCA), 2011 38th Annual International Symposium on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4503-0472-6