DocumentCode :
2073327
Title :
Cost-effective safety and fault localization using distributed temporal redundancy
Author :
Meyer, Brett H. ; Calhoun, Benton H. ; Lach, John ; Skadron, Kevin
Author_Institution :
Comput. Sci., Univ. of Virginia, Charlottesville, VA, USA
fYear :
2011
fDate :
9-14 Oct. 2011
Firstpage :
125
Lastpage :
134
Abstract :
Cost pressure is driving vendors of safety-critical systems to integrate previously distributed systems. One natural approach we have previous introduced is On-Demand Redundancy (ODR), which allows safety-critical and non-critical tasks, traditionally isolated to limit interference, to execute on shared resources. Our prior work has shown that relaxed dedication (RD), one ODR strategy which allows non-critical tasks (NCTs) to execute on idle critical task resources (CTRs), significantly increases NCT throughput. Unfortunately, there are circumstances under which, in spite of this opportunity, it is difficult to effectively schedule NCTs. In this paper, we introduce distributed temporal redundancy (DTR), which allows critical tasks, which traditionally execute in lockstep, to execute asynchronously. In doing so, DTR increases scheduling flexibility, resulting in systems that achieve much closer to the optimal NCT throughput than with relaxed dedication alone; in one set of experiments, DTR schedules no less 93% of the theoretical NCT cycles across a variety of synthetic benchmarks, outperforming RD by over 11%, on average. Furthermore, by distributing all redundant tasks across different resources, triple-modular redundancy, and therefore fault localization, can be achieved. We demonstrate that this can be accomplished with little additional cost and complexity: in practice, relatively few DTR tasks are in fight simultaneously, limiting the additional buffering needed to support DTR.
Keywords :
benchmark testing; fault tolerant computing; processor scheduling; redundancy; resource allocation; DTR; ODR strategy; cost-effective safety; critical task resources; distributed temporal redundancy; fault localization; noncritical tasks; ondemand redundancy; optimal NCT throughput; relaxed dedication; resource sharing; safety-critical system; scheduling flexibility; synthetic benchmarks; triple-modular redundancy; Hardware; Interference; Processor scheduling; Redundancy; Schedules; Throughput; Safety-critical; on-demand redundancy; system-level design;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Compilers, Architectures and Synthesis for Embedded Systems (CASES), 2011 Proceedings of the 14th International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4503-0713-0
Type :
conf
Filename :
6062038
Link To Document :
بازگشت