Title :
A Fault-Tolerant Low-Energy Multi-Application Mapping onto NoC-based Multiprocessors
Author :
Khalili, F. ; Zarandi, Hamid Reza
Author_Institution :
Dept. of Comput. Eng. & Inf. Technol., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
This paper proposes a fault-tolerant multi-application mapping technique in NoC-based multiprocessor platforms. The proposed mapping technique is composed of two main parts: 1) Mapping an application core graph to a free non-faulty processing cores, 2) Placing spare cores among other free non-faulty processing cores. The former is a heuristic algorithm to map application core graph onto the platform to achieve higher performance and lower communication energy than previous techniques. The latter tunes the location of spare cores based on each application core graph, considering both transient and permanent core failures. With a good spare core placement, not only resource management is well performed, but also the failure containment is improved within the system. Many application core graphs generated by TGFF are used to evaluate the proposed technique. The simulations are performed using cycle-accurate Noxim simulator. The results of 1,000,000 fault injection experiments show that with confidence level of 95%, the proposed technique leads to communication energy reductions and performance improvement, compared to well-known related works.
Keywords :
circuit simulation; failure analysis; fault tolerant computing; graph theory; low-power electronics; multiprocessing systems; network-on-chip; performance evaluation; resource allocation; NoC-based multiprocessor platforms; TGFF; application core graph mapping; communication energy reductions; cycle-accurate Noxim simulator; failure containment improvement; fault injection experiments; fault tolerant low-energy multiapplication mapping; free nonfaulty processing cores; heuristic algorithm; performance improvement; permanent core failures; resource management; spare core placement; transient failures; Circuit faults; Energy consumption; Fault tolerance; Fault tolerant systems; Heuristic algorithms; Scheduling; Tiles; Application Mapping; Dynamic Spare Placement; Failure Recovery; Fault-Tolerant; Task Migration;
Conference_Titel :
Computational Science and Engineering (CSE), 2012 IEEE 15th International Conference on
Conference_Location :
Nicosia
Print_ISBN :
978-1-4673-5165-2
Electronic_ISBN :
978-0-7695-4914-9
DOI :
10.1109/ICCSE.2012.65