Title :
Tackling QoS-induced aging in exascale systems through agile path selection
Author :
Ancajas, D.M. ; Chakraborty, K. ; Roy, S. ; Allred, J.
Author_Institution :
USU BRIDGE Lab., Utah State Univ., Logan, UT, USA
Abstract :
Network-On-Chips (NoCs) have become the standard communication platform for future massively parallel systems due to their performance, flexibility and scalability advantages. However, reliability issues brought about by scaling in the sub-20nm era threaten to undermine the benefits offered by NoCs. In this paper, we showthat QoS policies exacerbate the reliability profile of an exascale system. To mitigate this imposing challenge, we propose Dynamic Wearout Resilient Routing (DWRR) algorithms in QoS-enabled exascale NoCs. Our proposal includes two novel DWRR algorithms enabled by a critical-pathmonitor and a broadcast-based routing configuration. Using PARSEC benchmarks, our best algorithm improves QoS and long-term sustainability (Mean Time To Failure) of the system by an average of 16% and 25% compared to a state-of-the-art fault tolerant technique, respectively.
Keywords :
ageing; fault tolerant computing; network-on-chip; parallel programming; quality of service; reliability; DWRR algorithms; NoC; PARSEC benchmarks; QoS; agile path selection; aging; broadcast-based routing; dynamic wearout resilient routing algorithms; exascale systems; fault tolerant technique; network-on-chips; parallel systems; reliability; Bandwidth; Degradation; Delays; Heuristic algorithms; Quality of service; Reliability; Routing;
Conference_Titel :
Hardware/Software Codesign and System Synthesis (CODES+ISSS), 2014 International Conference on
Conference_Location :
New Delhi
DOI :
10.1145/2656075.2656100