DocumentCode
292320
Title
Implementing halt on failure processors
Author
Macdonald, R.N. ; Shoja, G.C.
Author_Institution
Dept. of Comput. Sci., Victoria Univ., BC, Canada
Volume
1
fYear
1993
fDate
19-21 May 1993
Firstpage
272
Abstract
The problem of detecting and masking failed processes in a distributed processing environment is considered. The authors propose a virtual halt on failure processor where replicated processes are used to achieve fault tolerance. Processor failures are detected and masked up to a certain limit. Once the threshold of permissible node failures is exceeded, the virtual processor reports the failure and halts. The authors contend that this is more practical and efficient than the generally assumed fail-stop processor. Results of an implementation in the REM (Remote Execution Manager) environment are presented
Keywords
distributed processing; fault tolerant computing; virtual machines; Remote Execution Manager; distributed processing; fault tolerance; halt on failure processors; processor failure masking; replicated processes; virtual processor; Computer errors; Computer science; Distributed processing; Fault detection; Fault tolerance; Fault tolerant systems; Scholarships; Time factors; Timing; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Computers and Signal Processing, 1993., IEEE Pacific Rim Conference on
Conference_Location
Victoria, BC
Print_ISBN
0-7803-0971-5
Type
conf
DOI
10.1109/PACRIM.1993.407171
Filename
407171
Link To Document