Title :
Monitoring reliability in embedded processors - A multi-layer view
Author_Institution :
ARM R&D, San Jose, CA, USA
Abstract :
Scaling to sub-20nm technology nodes changes the nature of reliability effects from abrupt functional problems to progressive degradation of the performance characteristics of devices and system components. Further, application workloads can significantly affect the overall system reliability. In this work, we have analyzed aging effects on various design hierarchies of an embedded commercial processor in 28nm running real-world applications. We have also quantified the dependencies of aging effects on switching-activity and power-state of workloads. Implementation results show that the processor timing degradation can vary from 2% to 11%, depending on the workload. Due to the dependence of aging on the application workloads, margin based design will be highly pessimistic. We propose an efficient and flexible in situ monitoring methodology, SlackProbe, which inserts timing monitors at both path endpoints and path intermediate nets. We show that SlackProbe reduces the numbers of monitors required by over 15X with ~5% additional delay margin in several commercial processor benchmarks. The real-time data from these monitors can be used for hardware and software adaptation to mitigate failures due to aging.
Keywords :
embedded systems; failure analysis; performance evaluation; SlackProbe; abrupt functional problems; aging effects; application workloads; delay margin; design hierarchies; embedded processor reliability monitoring; failure mitigation; hardware adaptation; in-situ monitoring methodology; margin-based design; path endpoints; path intermediate nets; processor benchmarks; processor timing degradation; progressive performance characteristics degradation; real-world applications; software adaptation; switching-activity; timing monitors; workload power-state; Abstracts; Digital audio players; Logic gates; Monitoring; Switches; Timing; Transistors;
Conference_Titel :
Design Automation Conference (DAC), 2014 51st ACM/EDAC/IEEE
Conference_Location :
San Francisco, CA
DOI :
10.1145/2593069.2596682