DocumentCode :
2172986
Title :
Fault-tolerant average execution time optimization for general-purpose multi-processor system-on-chips
Author :
Väyrynen, Mikael ; Singh, Virendra ; Larsson, Erik
Author_Institution :
Dept. of Comput. Sci., Linkoping Univ., Linkoping
fYear :
2009
fDate :
20-24 April 2009
Firstpage :
484
Lastpage :
489
Abstract :
Fault-tolerance is due to the semiconductor technology development important, not only for safety-critical systems but also for general-purpose (non-safety critical) systems. However, instead of guaranteeing that deadlines always are met, it is for general-purpose systems important to minimize the average execution time (AET) while ensuring fault-tolerance. For a given job and a soft (transient) error probability, we define mathematical formulas for AET that includes bus communication overhead for both voting (active replication) and rollback-recovery with checkpointing (RRC). And, for a given multi-processor system-on-chip (MPSoC), we define integer linear programming (ILP) models that minimize AET including bus communication overhead when: (1) selecting the number of checkpoints when using RRC, (2) finding the number of processors and job-to-processor assignment when using voting, and (3) defining fault-tolerance scheme (voting or RRC) per job and defining its usage for each job. Experiments demonstrate significant savings in AET.
Keywords :
checkpointing; fault tolerance; integer programming; linear programming; microprocessor chips; probability; safety-critical software; system buses; system-on-chip; bus communication overhead; fault-tolerance; fault-tolerant average execution time optimization; general-purpose multiprocessor system-on-chips; general-purpose systems; integer linear programming models; job-to-processor assignment; rollback-recovery with checkpointing; safety-critical systems; semiconductor technology development; soft error probability; voting active replication; Checkpointing; Circuit faults; Error probability; Fault detection; Fault tolerance; Fault tolerant systems; Integer linear programming; Production; System-on-a-chip; Voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition, 2009. DATE '09.
Conference_Location :
Nice
ISSN :
1530-1591
Print_ISBN :
978-1-4244-3781-8
Type :
conf
DOI :
10.1109/DATE.2009.5090713
Filename :
5090713
Link To Document :
بازگشت