Title :
Thermal and mechanical analysis and design of the IBM Power 775 water cooled supercomputing central electronics complex
Author :
Goth, Gary F. ; Arvelo, Amilcar ; Eagle, Jason ; Ellsworth, Michael J., Jr. ; Marston, Kenneth C. ; Sinha, Arvind K. ; Zitz, Jeffrey A.
Author_Institution :
IBM Corp., Poughkeepsie, NY, USA
fDate :
May 30 2012-June 1 2012
Abstract :
Back in 2008 IBM reintroduced water cooling technology into its high performance computing platform, the Power 575 Supercomputing node/system. Water cooled cold plates were used to cool the processor modules which represented about half of the total system (rack) heat load. An air-to-liquid heat exchanger was also mounted in the rear door of the rack to remove a significant fraction of the other half of the rack heat load; the heat load to air. Water cooling enabled a compute node with 34% greater performance (Flops), resulted in a processor temperature 20-30°C lower than that typically provided with air cooling, and reduced the power consumed in the data center to transfer the IT heat to the outside ambient by as much as 45%. The next generation of this platform, the Power 775 Supercomputing node/system, is a significant leap forward in computing performance and energy efficiency. The compute node and system were designed from the start with water cooling in mind. The result, a system with greater than 95% of its heat load conducted directly to water; a system that, together with a rear door heat exchanger, removes 100% of its heat load to water with no requirement for room air conditioning. In addition to the processor, memory, power conversion, and I/O electronics conduct their heat to water. Included within the framework of the system is a disk storage unit (disc enclosure) containing an inter-board air-to-water heat exchanger. This paper will detail key thermal and mechanical design issues associated with the Power 775 server drawer or central electronics complex (CEC). Topics to be addressed include processor and optical I/O Hub Module thermal design (including thermal interfaces); water cooled memory design; module cold plate designs; CEC level water distribution; module level structural analyses for thermal performance; module/board land grid array (LGA) load distribution; effect of load distribution on module thermal interfaces; and the effect of c- ld plate tubing on module (LGA) loading.
Keywords :
ball grid arrays; computer centres; cooling; heat exchangers; mainframes; thermal analysis; CEC level water distribution; I/O electronics; IBM power 775 water cooled supercomputing central electronic complex design; LGA load distribution effect; air cooling; air-to-liquid heat exchanger; cold plate tubing effect; data center; disk storage unit; energy efficiency; interboard air-to-water heat exchanger; mechanical analysis; mechanical design; module LGA loading; module cold plate designs; module level structural analyses; module thermal interfaces; module-board land grid array load distribution; optical I/O hub module thermal design; performance computing platform; power 775 server drawer; power conversion; processor modules cooling; room air conditioning; temperature 20 degC to 30 degC; thermal analysis; total system heat load; water cooled cold plates; water cooled memory design; Assembly; Cold plates; Manifolds; Servers; Water heating;
Conference_Titel :
Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm), 2012 13th IEEE Intersociety Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-9533-7
Electronic_ISBN :
1087-9870
DOI :
10.1109/ITHERM.2012.6231496