Title :
FIMD-MPI: a tool for injecting faults into MPI application
Author :
Blough, Douglas M. ; Liu, Peng
Author_Institution :
Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Abstract :
Parallel computing is seeing increasing use in critical applications. The need therefore arises to test the robustness of parallel applications in the presence of exceptional conditions, or faults. Communication-software-based fault injection is an extremely flexible approach to robustness testing in message-passing parallel computers. A fault injection methodology and tool that use this approach are presented. The tool, known as FIMD-MPI, allows injection of faults into MPI-based applications. The structure and operation of FIMD-MPI are described and the use of the tool is illustrated on an example fault-tolerant MPI application
Keywords :
fault tolerant computing; message passing; parallel processing; FIMD-MPI; MPI application; communication-software-based fault injection; faults injection; message-passing parallel computers; parallel computing; robustness testing; Application software; Clocks; Computer errors; Concurrent computing; Error correction; Fault tolerance; Parallel algorithms; Synchronization; Testing; Timing;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2000. IPDPS 2000. Proceedings. 14th International
Conference_Location :
Cancun
Print_ISBN :
0-7695-0574-0
DOI :
10.1109/IPDPS.2000.845991