• DocumentCode
    1989890
  • Title

    Fault-tolerant computing on cluster of workstations

  • Author

    Bataineh, S. ; Al-Karaki, J.

  • Author_Institution
    Dept. of Comput. Eng., Jordan Univ. of Sci. & Technol., Irbid, Jordan
  • fYear
    2003
  • fDate
    14-18 July 2003
  • Firstpage
    16
  • Abstract
    Summary form only given. Employing queuing theory, closed form solutions for the response time of fault tolerant clusters of workstations are obtained. Workstations (nodes) in the cluster are divided into two sets: active set and back up set. Fault tolerance is achieved by having a set of active nodes to replicate their services at a set of backup nodes. Active nodes periodically checkpoint their status on the backups. If an active node fails, one of the backups takes over and joins the active set. Two immediate repair mechanisms are considered to repair faulty nodes in the system. In addition to their closed form formats, the analytical results have several advantages over those presented in the previous work. Unlike previous work in this area, there is no need to solve a set of recursive equations and the results reveal much of the characteristics of the system.
  • Keywords
    fault tolerant computing; queueing theory; system recovery; workstation clusters; active node; active set; back up set; backup node; closed form format; fault-tolerant computing; faulty node; queuing theory; recursive equation; repair mechanism; response time; service replication; workstation cluster; Closed-form solution; Delay; Equations; Fault tolerance; Queueing analysis; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications, 2003. Book of Abstracts. ACS/IEEE International Conference on
  • Conference_Location
    Tunis, Tunisia
  • Print_ISBN
    0-7803-7983-7
  • Type

    conf

  • DOI
    10.1109/AICCSA.2003.1227453
  • Filename
    1227453