• DocumentCode
    3653741
  • Title

    A multi-layer software-based fault-tolerance approach for heterogenous multi-core systems

  • Author

    S. M?ller;T. Koal;S. Scharoba;H.T. Vierhaus;M. Sch?lzel

  • Author_Institution
    Brandenburg University of Technology, Cottbus, Germany
  • fYear
    2015
  • fDate
    3/1/2015 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper describes a software-based technique for building heterogeneous fault tolerant multi-core systems, which are able to handle temporary and permanent hardware faults autonomously in two system layers. The fault tolerance technique relies on a single concept for adapting the binary code of the user application to the current fault state of a single core. Thereby this scheme is used either for a local repair of each core or for a global repair. By the global repair, the task assigned to a faulty core may be rescheduled to another core that provides enough resources for the execution of the task. Thereby the local repair scheme is reused for the adaptation of the rescheduled task. It is shown that the reliability of a multi-core system can be improved significantly, when using the global repair together with the local repair instead of using the local repair only.
  • Keywords
    "Maintenance engineering","Registers","Multicore processing","Redundancy","Hardware","Binary codes","Multiprocessor interconnection"
  • Publisher
    ieee
  • Conference_Titel
    Test Symposium (LATS), 2015 16th Latin-American
  • ISSN
    2373-0862
  • Type

    conf

  • DOI
    10.1109/LATW.2015.7102508
  • Filename
    7102508