• DocumentCode
    1152680
  • Title

    Distributed Recovery in Fault-Tolerant Multiprocessor Networks

  • Author

    Yanney, Raif M. ; Hayes, John P.

  • Author_Institution
    TRW
  • Issue
    10
  • fYear
    1986
  • Firstpage
    871
  • Lastpage
    879
  • Abstract
    A methodology for characterizing dynamic distributed recovery in fault-tolerant multiprocessor systems is developed using graph theory. Distributed recovery, which is intended for systems with no central supervisor, depends on the cooperation of a set of processors to execute the recovery function, since each processor is assumed to have only a limited amount of information about the system as a whole. Facility graphs, whose nodes denote the system components (processors), and whose edges denote interconnection between components, are used to represent multiprocessor systems, and error conditions. A general distributed recovery strategy R, which allows global recovery to be achieved via a sequence of local actions, is given. R recovers the system in several steps in which different nodes successively act as the local supervisor. R is specialized for two important classes of systems: loop networks and tree networks. For each of these cases, fault-tolerant designs and their associated distributed recovery strategies, which allow recovery from up to k faults within a specified number of steps, are presented.
  • Keywords
    Distributed recovery; fault tolerance; fault- tolerant multiprocessor systems; graph theory; loop networks; reconfiguration; tree networks; Communication networks; Computer errors; Computer networks; Degradation; Fault tolerance; Fault tolerant systems; Graph theory; Intelligent networks; Multiprocessing systems; Tree graphs; Distributed recovery; fault tolerance; fault- tolerant multiprocessor systems; graph theory; loop networks; reconfiguration; tree networks;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.1986.1676678
  • Filename
    1676678