DocumentCode :
1554996
Title :
Replicated process allocation for load distribution in fault-tolerant multicomputers
Author :
Kim, Jong ; Lee, Heejo ; Lee, Sunggu
Author_Institution :
Dept. of Comput. Sci. & Eng., Pohang Univ. of Sci. & Technol., South Korea
Volume :
46
Issue :
4
fYear :
1997
fDate :
4/1/1997 12:00:00 AM
Firstpage :
499
Lastpage :
505
Abstract :
In this paper, we consider a load-balancing process allocation method for fault-tolerant multicomputer systems that balances the load before as well as after faults start to degrade the performance of the system. In order to be able to tolerate a single fault, each process (primary process) is duplicated (i.e., has a backup process). The backup process executes on a different processor from the primary, checkpointing the primary process and recovering the process in the primary process fails. In this paper, we formalize the problem of load-balancing process allocation and propose a new process allocation method and analyze the performance of the proposed method. Simulations are used to compare the proposed method with a process allocation method that does not take into account the different load characteristics of the primary and backup processes. While both methods perform well before the occurrence of a fault, only the proposed method maintains a balanced load after the occurrence of such a fault
Keywords :
digital simulation; fault tolerant computing; multiprocessing systems; performance evaluation; resource allocation; backup process; fault-tolerant multicomputers; load characteristics; load distribution; load-balancing process allocation; performance; replicated process allocation; simulations; Checkpointing; Degradation; Distributed computing; Fault tolerance; Fault tolerant systems; Load management; Performance analysis; Real time systems;
fLanguage :
English
Journal_Title :
Computers, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9340
Type :
jour
DOI :
10.1109/12.588067
Filename :
588067
Link To Document :
بازگشت