DocumentCode
3354966
Title
Scheduling message processing for reducing rollback propagation
Author
Wang, Y.-M. ; Fuchs, W.K.
Author_Institution
Coordinated Sci. Lab., Illinois Univ., Urbana, IL, USA
fYear
1992
fDate
8-10 July 1992
Firstpage
204
Lastpage
211
Abstract
The authors show that the probability of rollback propagation in a message-passing system can often be greatly reduced by reordering the processing of messages. Also, rollback propagation was measured for several parallel programs. A scheduling algorithm for message processing and its implementation for reducing rollback propagation are described. The algorithm incorporates a user-transparent prioritized scheme based on the run-time communication and checkpointing history. Communication trace-driven simulation for several parallel programs written in the Chare Kernel language demonstrated that the probability of rollback propagation can be reduced at the cost of slight additional performance degradation.<>
Keywords
fault tolerant computing; message passing; parallel programming; scheduling; Chare Kernel language; checkpointing; message processing scheduling; parallel programs; rollback propagation; run-time communication; trace-driven simulation; user-transparent prioritized scheme; Checkpointing; Costs; Degradation; History; Kernel; NASA; Processor scheduling; Runtime; Scheduling algorithm; Synchronization;
fLanguage
English
Publisher
ieee
Conference_Titel
Fault-Tolerant Computing, 1992. FTCS-22. Digest of Papers., Twenty-Second International Symposium on
Conference_Location
Boston, MA, USA
Print_ISBN
0-8186-2875-8
Type
conf
DOI
10.1109/FTCS.1992.243599
Filename
243599
Link To Document