Title :
Incorporation of optimal timeouts into distributed real-time load sharing
Author :
Hou, Chao-Ju ; Shin, Kang G.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI, USA
Abstract :
The authors address the problem of designing and incorporating a timeout mechanism into load sharing (LS) with state-region change broadcasts in the presence of node failures in a distributed real-time system. They formulate the problem of determining the best timeout period Tout<i> for node i as a hypothesis testing problem, and maximize the probability of detecting node failures subject to a prespecified probability of falsely diagnosing a healthy node as faulty. They outline the LS algorithm and the proposed timeout mechanism and establish a theoretical basis for the calculation of optimal Tout<i>. The simulation results show that the LS algorithm, which combines online parameter estimation, the timeout mechanism, and a few extra, timely broadcasts, can significantly reduce the probability of missing task deadlines
Keywords :
digital simulation; distributed processing; parameter estimation; real-time systems; resource allocation; distributed real-time load sharing; hypothesis testing problem; load sharing; node failures; online parameter estimation; optimal timeouts; simulation; state-region change broadcasts; timely broadcasts; timeout mechanism; Bayesian methods; Broadcasting; Chaotic communication; Distributed computing; Fault detection; Laboratories; Parameter estimation; Real time systems; Testing; Time sharing computer systems;
Conference_Titel :
System Sciences, 1993, Proceeding of the Twenty-Sixth Hawaii International Conference on
Conference_Location :
Wailea, HI
Print_ISBN :
0-8186-3230-5
DOI :
10.1109/HICSS.1993.284065