• DocumentCode
    2234409
  • Title

    Tolerating transient communication faults with online traffic scheduling

  • Author

    Marques, Luis ; Vasconcelos, Verónica ; Pedreiras, Paulo ; Almeida, Luis

  • Author_Institution
    ISEC, IPC, Portugal
  • fYear
    2012
  • fDate
    19-21 March 2012
  • Firstpage
    396
  • Lastpage
    402
  • Abstract
    Building distributed embedded systems that will be fault-free for all their lifetime is virtually impossible, thus the systems must deal with them if a continued correct behavior is needed. This is the case of safety-critical systems, such as X-by-wire systems in the automotive domain. Concerning transient communication faults in particular, they can be dealt with at various levels of the protocol stacks, with different techniques, e.g., temporal and spatial redundancy. In this paper we focus on temporal redundancy and we address the limitations imposed by typical time-triggered systems, commonly found in safety-critical systems, arising from their static traffic definition. In these systems the use of temporal redundancy to handle communication errors requires the pre-allocation of communication resources that, in the absence of errors, are wasted. Therefore, we propose an online traffic scheduling approach in which retransmissions are consistently scheduled with the remaining time-triggered traffic, using the unique flexibility provided by the FTT-CAN protocol (Flexible Time-Triggered communication on CAN). We address the integration of appropriate fault detectors in the FTT-CAN protocol to monitor the bus activity and re-schedule omitted messages. We show that this approach is more efficient than the static allocations, since communication resources are only allocated when necessary. We also discuss alternative realizations and validate the approach with initial results from a prototype implementation.
  • Keywords
    controller area networks; distributed processing; embedded systems; fault tolerant computing; protocols; scheduling; FTT-CAN protocol; X-by-wire systems; distributed embedded systems; flexible time-triggered communication; online traffic scheduling; safety-critical systems; temporal redundancy; time-triggered systems; transient communication fault tolerance; Reliability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Technology (ICIT), 2012 IEEE International Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4673-0340-8
  • Type

    conf

  • DOI
    10.1109/ICIT.2012.6209970
  • Filename
    6209970