Title :
Generalized Standby-Sparing techniques for energy-efficient fault tolerance in multiprocessor real-time systems
Author :
Yifeng Guo ; Dakai Zhu ; Aydin, Hakan
Author_Institution :
Univ. of Texas at San Antonio, San Antonio, TX, USA
Abstract :
The Standby-Sparing (SS) technique has been previously explored to improve energy efficiency while providing fault tolerance in dual-processor real-time systems. In this paper, by considering both transient and permanent faults, we develop energy-efficient fault tolerance techniques for real-time systems deploying an arbitrary number of identical processors. First, we study the Paired-SS technique, where processors are organized as groups of two (i.e., pairs) and SS is applied within each pair of processors directly after partitioning tasks to the pairs. Then, we propose a Generalized-SS technique that partitions processors into two groups containing primary and secondary processors, respectively. The main and backup copies of tasks are executed on the primary and secondary processors under the partitioned-EDF and partitioned-EDL scheduling policies, respectively. The objective is to reduce the overlapped executions of the main and backup copies in order to improve energy savings. Our experimental evaluations show that, for a given system with fixed number of processors, typically there exists a configuration of primary and secondary processors under the Generalized-SS technique that can lead to better energy savings when compared to the Paired-SS technique.
Keywords :
fault tolerant computing; multiprocessing systems; power aware computing; real-time systems; scheduling; arbitrary identical processors; backup copies; dual-processor real-time systems; earliest-deadline-first policy; earliest-deadline-late policy; energy efficiency improvement; energy saving improvement; energy-efficient fault tolerance techniques; generalized standby-sparing techniques; generalized-SS technique; main copies; multiprocessor real-time systems; overlapped execution reduction; paired-SS technique; partitioned-EDF scheduling policy; partitioned-EDL scheduling policy; permanent faults; primary processors; processor partitioning; secondary processors; task partitioning; transient faults; Circuit faults; Energy consumption; Program processors; Real-time systems; Reliability; Transient analysis; Voltage control;
Conference_Titel :
Embedded and Real-Time Computing Systems and Applications (RTCSA), 2013 IEEE 19th International Conference on
Conference_Location :
Taipei
DOI :
10.1109/RTCSA.2013.6732204