DocumentCode
154146
Title
Software-Managed Power Reduction in Infiniband Links
Author
Dickov, Branimir ; Pericas, Miquel ; Carpenter, Paul ; Navarro, Nacho ; Ayguade, Eduard
fYear
2014
fDate
9-12 Sept. 2014
Firstpage
311
Lastpage
320
Abstract
The backbone of a large-scale supercomputer is the interconnection network. As compute nodes become more energy-efficient, the interconnect is accounting for an increasing proportion of the total system energy consumption. The interconnect´s energy consumption is, however, only starting to receive serious attention. Some hardware-based schemes have been proposed that exploit idle periods or low utilisation, either by turning off the links or by lowering the frequency and voltage. Although these schemes are effective in certain cases, they do not have enough global information about the application´s communication behaviour to efficiently manage the network power consumption. This paper proposes an alternative approach: moving the intelligence into the PMPI layer of the MPI library, and using prediction to discover repetitive patterns in the application´s communication behaviour. The core of the prediction algorithm is an n-gram extraction technique, which can accurately predict not only when a link will become unused but also when it will become active again, allowing lanes to be switched off during the idle periods and switched back on again in time to avoid incurring a significant performance degradation. Many HPC applications benefit from prediction, since they have repetitive computation and communication phases. By implementing the energy-saving mechanism inside the MPI library, existing MPI programs do not need to be modified. Using an event-driven simulator, driven by representative HPC workloads, we demonstrate average energy savings in Infiniband switches up to around 33%, while the average execution time increase is only up to 1%.
Keywords
application program interfaces; discrete event simulation; interconnections; mainframes; power aware computing; power consumption; HPC applications; HPC workloads; Infiniband links; Infiniband switches; MPI library; MPI programs; PMPI layer; communication behaviour; compute nodes; energy-saving mechanism; event-driven simulator; global information; interconnection network; large-scale supercomputer; n-gram extraction technique; network power consumption; prediction algorithm; total system energy consumption; Arrays; Energy consumption; Pattern matching; Power demand; Prediction algorithms; Program processors; Supercomputers;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing (ICPP), 2014 43rd International Conference on
Conference_Location
Minneapolis MN
ISSN
0190-3918
Type
conf
DOI
10.1109/ICPP.2014.40
Filename
6957240
Link To Document