DocumentCode :
2306515
Title :
Recovery of fault-tolerant real-time scheduling algorithm for tolerating multiple transient faults
Author :
Pathan, Risat Mahmud
Author_Institution :
Dept. of Comput. Sci. & Eng., BRAC Univ., Dhaka
fYear :
2007
fDate :
27-29 Dec. 2007
Firstpage :
1
Lastpage :
6
Abstract :
The consequences of missing deadline of hard real time system tasks may be catastrophic. Moreover, in case of faults, a deadline can be missed if the time taken for recovery is not taken into account during the phase when tasks are submitted or accepted to the system. However, when faults occur tasks may miss deadline even if fault tolerance is employed. Because when an erroneous task with larger execution time executes up to end of its total execution time even if the error is detected early, this unnecessary execution of the erroneous task provides no additional slack time in the schedule to mitigate the effect of error by running additional copy of the same task without missing deadline. In this paper, a recovery mechanism is proposed to augment the fault-tolerant real-time scheduling algorithm RM-FT that achieves node level fault tolerance (NLFT) using temporal error masking (TEM) technique based on rate monotonic (RM) scheduling algorithm. Several hardware and software error detection mechanisms (EDM), i.e. watchdog processor or executable assertions, can detect an error before an erroneous task finishes its full execution, and can immediately stops execution. In this paper, using the advantage of such early detection by EDM, a recovery algorithm RM-FT-RECOVERY is proposed to find an upper bound, denoted by Edm Bound, on the execution time of the tasks, and mechanism is developed to provide additional slack time to a fault-tolerant real-time schedule so that additional task copies can be scheduled when error occurs.
Keywords :
fault tolerant computing; real-time systems; scheduling; fault-tolerant real-time scheduling algorithm; larger execution time; multiple transient faults; node level fault tolerance; rate monotonic scheduling algorithm; recovery mechanism; software error detection mechanisms; temporal error masking; Decision support systems; Fault tolerance; Scheduling algorithm; Virtual reality; EDM; Fixed-Priority Scheduling; NLFT; Real-Time Fault-Tolerant systems; TEM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and information technology, 2007. iccit 2007. 10th international conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4244-1550-2
Electronic_ISBN :
978-1-4244-1551-9
Type :
conf
DOI :
10.1109/ICCITECHN.2007.4579369
Filename :
4579369
Link To Document :
بازگشت