• DocumentCode
    1075989
  • Title

    Design Optimization of Time- and Cost-Constrained Fault-Tolerant Embedded Systems With Checkpointing and Replication

  • Author

    Pop, Paul ; Izosimov, Viacheslav ; Eles, Petru ; Peng, Zebo

  • Author_Institution
    Dept. of Inf. & Math. Modelling, Tech. Univ. of Denmark, Kongens Lyngby
  • Volume
    17
  • Issue
    3
  • fYear
    2009
  • fDate
    3/1/2009 12:00:00 AM
  • Firstpage
    389
  • Lastpage
    402
  • Abstract
    We present an approach to the synthesis of fault-tolerant hard real-time systems for safety-critical applications. We use checkpointing with rollback recovery and active replication for tolerating transient faults. Processes and communications are statically scheduled. Our synthesis approach decides the assignment of fault-tolerance policies to processes, the optimal placement of checkpoints and the mapping of processes to processors such that multiple transient faults are tolerated and the timing constraints of the application are satisfied. We present several design optimization approaches which are able to find fault-tolerant implementations given a limited amount of resources. The developed algorithms are evaluated using extensive experiments, including a real-life example.
  • Keywords
    checkpointing; embedded systems; fault tolerant computing; microprocessor chips; processor scheduling; active replication; checkpointing; embedded systems; fault tolerance; processor scheduling; real-time systems; rollback recovery; transient faults; Fault tolerance; processor scheduling; real time systems; redundancy;
  • fLanguage
    English
  • Journal_Title
    Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-8210
  • Type

    jour

  • DOI
    10.1109/TVLSI.2008.2003166
  • Filename
    4757196