DocumentCode
1138392
Title
Process recovery in heterogeneous systems
Author
Ssu, Kuo-Feng ; Fuchs, W. Kent ; Jiau, Hewijin C.
Author_Institution
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume
52
Issue
2
fYear
2003
Firstpage
126
Lastpage
138
Abstract
Heterogeneous computing environments, where computers may have different instruction set architectures, data representations, and operating systems, complicate checkpointing and recovery of processes. This paper describes an approach to recovery and an implementation, PREACHES, that provides portable checkpointing of single-process applications in heterogeneous systems using checkpoint propagation. The checkpoint propagation mechanism creates machine-dependent checkpoints for different architectures in the heterogeneous environment. A process is restored on a specific machine with the checkpoint that is appropriate for the architecture. An implementation of PREACHES has been evaluated on a heterogeneous network of workstations, including Sun, HP, and Pentium machines. The experimental results show that PREACHES achieves efficient checkpointing and rapid recovery.
Keywords
fault tolerant computing; protocols; system recovery; PREACHES; checkpointing; data representations; heterogeneous systems; instruction set architectures; machine-dependent checkpoints; operating systems; process recovery; Application software; Checkpointing; Computer aided instruction; Computer architecture; Data conversion; Data mining; Delay; Operating systems; Sun; Workstations;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2003.1176981
Filename
1176981
Link To Document