• DocumentCode
    2932742
  • Title

    Modeling Distributed Computing System Reliability with DRBD

  • Author

    Distefano, Salvatore ; Scarpa, Marco ; Puliafito, Antonio

  • Author_Institution
    Eng. Fac., Messina Univ.
  • fYear
    2006
  • fDate
    2-4 Oct. 2006
  • Firstpage
    106
  • Lastpage
    118
  • Abstract
    Nowadays the great part of devices or systems we commonly use are often driven or managed by microchips and computers: cars, music players, phones, trains, planes, .... A consolidated trend of technology is to substitute mechanical with electronic parts, analogical with digital devices or controls, and so on. In this context, features like security, availability and reliability, usually summarized under the concept of dependability, are receiving higher attention. The dependability analysis, especially for what regards critical parts as computing systems or subsystems, is becoming more strategic: specific requirements and explicit or tighter constraints have to be satisfied. Even though this fact, there is a lack of suitable tools to properly model and analyze these aspects, with particular reference to reliability. To fill this gap, we propose the dynamic reliability block diagram (DRBD) modeling tool derived from the reliability block diagram (RBD) formalism. The DRBD permits to model the dynamic reliability behavior of a system through dependence models, exploited to represent dynamics behaviors as redundancy, load sharing, multiple, probabilistic and common failure mode. In this paper, the DRBD expressiveness and other capabilities, are illustrated through the analysis of a complex distributed computing system taken as example
  • Keywords
    distributed processing; software reliability; DRBD modeling tool; dependence models; distributed computing system reliability modeling; dynamic reliability block diagram; dynamic reliability system behavior; reliability block diagram formalism; system dependability analysis; Availability; Digital control; Distributed computing; Engineering management; Fault trees; Neodymium; Redundancy; Reliability engineering; Resource management; Roentgenium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliable Distributed Systems, 2006. SRDS '06. 25th IEEE Symposium on
  • Conference_Location
    Leeds
  • ISSN
    1060-9857
  • Print_ISBN
    0-7695-2677-2
  • Type

    conf

  • DOI
    10.1109/SRDS.2006.32
  • Filename
    4032473