DocumentCode
3471946
Title
A flexible state-saving library for message-passing systems
Author
Clematis, A. ; Deconinck, G. ; Gianuzzi, V.
Author_Institution
Ist. per la Matematica Applicata, CNR, Genova, Italy
fYear
1998
fDate
21-23 Jan 1998
Firstpage
335
Lastpage
341
Abstract
Message passing applications on a distributed computer require tools to integrate state saving and rollback, to support dynamic program reconfiguration, fault tolerance and others. The paper presents the results of integrating two independently developed tools that combine flexibility and portability. The User-Triggered CheckPointing (UTCP) provides checkpointing and recovery while relying on the programmer to indicate the position of the recovery line and the contents of the checkpoint. The tool PVMsnap provides an extension to PVM to obtain a consistent cut of the message passing application. The combination of both tools results in a portable and flexible solution for fault tolerance which can be adapted to the applications´ needs
Keywords
message passing; program diagnostics; software fault tolerance; software libraries; system recovery; virtual machines; PVM; User-Triggered CheckPointing; distributed computer; dynamic program reconfiguration; fault tolerance; flexible solution; flexible state saving library; independently developed tools; message passing applications; message passing systems; recovery line; rollback; state saving; tool PVMsnap; Application software; Checkpointing; Computer applications; Distributed computing; Fault tolerance; Kernel; Libraries; Message passing; Programming profession; Protocols;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing, 1998. PDP '98. Proceedings of the Sixth Euromicro Workshop on
Conference_Location
Madrid
Print_ISBN
0-8186-8332-5
Type
conf
DOI
10.1109/EMPDP.1998.647217
Filename
647217
Link To Document