• DocumentCode
    3359262
  • Title

    On-demand thread-level fault detection in a concurrent programming environment

  • Author

    Jian Fu ; Qiang Yang ; Poss, Raphael ; Jesshope, Chris R. ; Chunyuan Zhang

  • Author_Institution
    Inf. Inst., Univ. of Amsterdam, Amsterdam, Netherlands
  • fYear
    2013
  • fDate
    15-18 July 2013
  • Firstpage
    255
  • Lastpage
    262
  • Abstract
    The vulnerability of multi-core processors is increasing due to tighter design margins and greater susceptibility to interference. Moreover, concurrent programming environments are the norm in the exploitation of multi-core systems. In this paper, we present an on-demand thread-level fault detection mechanism for multi-cores. The main contribution is on-demand redundancy, which allows users to set the redundancy scope in the concurrent code. To achieve this we introduce intelligent redundant thread creation and synchronization, which manages concurrency and synchronization between the redundant threads via the master. This framework was implemented in an emulation of a multi-threaded, many-core processor with single, in-order issue cores. It was evaluated by a range of programs in image and signal processing, and encryption. The performance overhead of redundancy is less than 11% for single core execution and is always less than 100% for all scenarios. This efficiency derives from the platform´s hardware concurrency management and latency tolerance.
  • Keywords
    concurrency control; cryptography; fault diagnosis; image processing; multi-threading; multiprocessing systems; performance evaluation; redundancy; signal processing; software fault tolerance; synchronisation; concurrent code; concurrent programming environments; encryption; hardware concurrency management; image processing; intelligent redundant thread creation and synchronization; latency tolerance; multicore processor vulnerability; multithreaded many-core processor emulation; on-demand thread-level fault detection mechanism; redundancy performance overhead; redundancy scope; signal processing; single core execution; single in-order issue cores; synchronization; Concurrent computing; Instruction sets; Message systems; Programming; Redundancy; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XIII), 2013 International Conference on
  • Conference_Location
    Agios Konstantinos
  • Type

    conf

  • DOI
    10.1109/SAMOS.2013.6621132
  • Filename
    6621132