• DocumentCode
    1217273
  • Title

    Architectural support for designing fault-tolerant open distributed systems

  • Author

    Hariri, Salim ; Choudhary, Alok ; Sarikaya, Behcet

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Syracuse Univ., NY, USA
  • Volume
    25
  • Issue
    6
  • fYear
    1992
  • fDate
    6/1/1992 12:00:00 AM
  • Firstpage
    50
  • Lastpage
    62
  • Abstract
    An overview of the main techniques for designing fault-tolerant software and hardware systems is provided. The important features of the building blocks (computers, memories, buses, etc.) that can support an efficient implementation of fault-tolerant open distributed systems (FTODSs) are identified. Taking into account the features of these building blocks, an organization for FTODS is proposed. A distributed voting algorithm and a two-level hierarchy for permanent memory are key elements in this scheme. The algorithms needed for transferring files and synchronizing the concurrent activities of the computing modules and for recovery-are ISO standard protocols. Low-level voting and recovery algorithms that can run as a layer of software above the operating system make the open distributed system an attractive environment for applying fault-tolerant techniques.<>
  • Keywords
    distributed processing; fault tolerant computing; open systems; ISO standard protocols; distributed voting algorithm; fault-tolerant open distributed systems; operating system; permanent memory; two-level hierarchy; Concurrent computing; Distributed computing; Fault diagnosis; Fault tolerance; Fault tolerant systems; Hardware; ISO standards; Software design; Software systems; Voting;
  • fLanguage
    English
  • Journal_Title
    Computer
  • Publisher
    ieee
  • ISSN
    0018-9162
  • Type

    jour

  • DOI
    10.1109/2.153257
  • Filename
    153257