• DocumentCode
    292320
  • Title

    Implementing halt on failure processors

  • Author

    Macdonald, R.N. ; Shoja, G.C.

  • Author_Institution
    Dept. of Comput. Sci., Victoria Univ., BC, Canada
  • Volume
    1
  • fYear
    1993
  • fDate
    19-21 May 1993
  • Firstpage
    272
  • Abstract
    The problem of detecting and masking failed processes in a distributed processing environment is considered. The authors propose a virtual halt on failure processor where replicated processes are used to achieve fault tolerance. Processor failures are detected and masked up to a certain limit. Once the threshold of permissible node failures is exceeded, the virtual processor reports the failure and halts. The authors contend that this is more practical and efficient than the generally assumed fail-stop processor. Results of an implementation in the REM (Remote Execution Manager) environment are presented
  • Keywords
    distributed processing; fault tolerant computing; virtual machines; Remote Execution Manager; distributed processing; fault tolerance; halt on failure processors; processor failure masking; replicated processes; virtual processor; Computer errors; Computer science; Distributed processing; Fault detection; Fault tolerance; Fault tolerant systems; Scholarships; Time factors; Timing; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and Signal Processing, 1993., IEEE Pacific Rim Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    0-7803-0971-5
  • Type

    conf

  • DOI
    10.1109/PACRIM.1993.407171
  • Filename
    407171