DocumentCode
3172095
Title
PREACHES-portable recovery and checkpointing in heterogeneous systems
Author
Kuo-Feng Ssu ; Fuchs, W.K.
Author_Institution
Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
fYear
1998
fDate
23-25 June 1998
Firstpage
38
Lastpage
47
Abstract
Checkpointing in a homogeneous environment, where both checkpointing and recovery are performed on the same type of machine and operating system, has been studied extensively. As heterogeneous distributed systems become pervasive, it is desirable to extend the capability of checkpointing to non-homogeneous environments. This paper describes a prototype, PREACHES, that achieves portable checkpointing of single process applications in heterogeneous systems using checkpoint propagation. The checkpoint propagation technique generates machine-dependent checkpoints for each different architecture in the heterogeneous environment. When failure occurs, the failed process can be restarted on a specified machine with the checkpoint that is appropriate for the architecture. An implementation of PREACHES on a heterogeneous network of workstations has been successfully developed based on TCP/IP communication. PREACHES also provides automatic and fast recovery for single process programs.
Keywords
distributed processing; local area networks; software fault tolerance; software portability; system recovery; transport protocols; PREACHES; TCP/IP; checkpoint propagation; checkpointing; heterogeneous distributed systems; operating system; single process applications; system recovery; workstation network; Checkpointing; Computer architecture; Contracts; Identity-based encryption; Instruction sets; Operating systems; Protocols; Prototypes; Registers; TCPIP;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault-Tolerant Computing, 1998. Digest of Papers. Twenty-Eighth Annual International Symposium on
Conference_Location
Munich, Germany
ISSN
0731-3071
Print_ISBN
0-8186-8470-4
Type
conf
DOI
10.1109/FTCS.1998.689453
Filename
689453
Link To Document