DocumentCode :
3471946
Title :
A flexible state-saving library for message-passing systems
Author :
Clematis, A. ; Deconinck, G. ; Gianuzzi, V.
Author_Institution :
Ist. per la Matematica Applicata, CNR, Genova, Italy
fYear :
1998
fDate :
21-23 Jan 1998
Firstpage :
335
Lastpage :
341
Abstract :
Message passing applications on a distributed computer require tools to integrate state saving and rollback, to support dynamic program reconfiguration, fault tolerance and others. The paper presents the results of integrating two independently developed tools that combine flexibility and portability. The User-Triggered CheckPointing (UTCP) provides checkpointing and recovery while relying on the programmer to indicate the position of the recovery line and the contents of the checkpoint. The tool PVMsnap provides an extension to PVM to obtain a consistent cut of the message passing application. The combination of both tools results in a portable and flexible solution for fault tolerance which can be adapted to the applications´ needs
Keywords :
message passing; program diagnostics; software fault tolerance; software libraries; system recovery; virtual machines; PVM; User-Triggered CheckPointing; distributed computer; dynamic program reconfiguration; fault tolerance; flexible solution; flexible state saving library; independently developed tools; message passing applications; message passing systems; recovery line; rollback; state saving; tool PVMsnap; Application software; Checkpointing; Computer applications; Distributed computing; Fault tolerance; Kernel; Libraries; Message passing; Programming profession; Protocols;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 1998. PDP '98. Proceedings of the Sixth Euromicro Workshop on
Conference_Location :
Madrid
Print_ISBN :
0-8186-8332-5
Type :
conf
DOI :
10.1109/EMPDP.1998.647217
Filename :
647217
Link To Document :
بازگشت