• DocumentCode
    1090823
  • Title

    Dependability modeling and analysis of distributed programs

  • Author

    Lopez-Benitez, Noé

  • Author_Institution
    Dept. of Comput. Sci., Texas Tech. Univ., Lubbock, TX, USA
  • Volume
    20
  • Issue
    5
  • fYear
    1994
  • fDate
    5/1/1994 12:00:00 AM
  • Firstpage
    345
  • Lastpage
    352
  • Abstract
    Presents a modeling approach based on stochastic Petri nets to estimate the reliability and availability of programs in a distributed computing system environment. In this environment, successful execution of programs is conditioned on the successful access of related files distributed throughout the system. The use of stochastic Petri nets is demonstrated by extending a basic reliability model to account for repair actions when faults occur. To this end, two possible models are discussed: the global repair model, which assumes a centralized repair team that restores the system to its original status when a failure state is reached, and the local repair model, which assumes that repairs are localized to the node where they occur. The former model is useful in evaluating the availability of programs (or the availability of the hardware support) subject to hardware faults that are repaired globally; therefore, the programs of interest can be interrupted. On the other hand, the latter model can be used to evaluate program reliability in the presence of hardware faults subject to repair, without interrupting the normal operation of the system
  • Keywords
    Petri nets; distributed algorithms; multiprocessing programs; program diagnostics; programming theory; software reliability; stochastic processes; system recovery; centralized repair team; dependability analysis; dependability modeling; distributed computing system environment; distributed programs; failure state; file distribution; global repair mode; hardware faults; hardware support; local repair model; program availability; program execution; program interruption; program reliability; repair actions; stochastic Petri nets; system status restoration; Availability; Distributed computing; Distributed control; Graph theory; Hardware; Petri nets; Reliability theory; Stochastic processes; Stochastic systems; Tree graphs;
  • fLanguage
    English
  • Journal_Title
    Software Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0098-5589
  • Type

    jour

  • DOI
    10.1109/32.286421
  • Filename
    286421