• DocumentCode
    2564010
  • Title

    Numerical techniques for assessing reliability and performance of gracefully degrading computer systems

  • Author

    Constantinescu, Cristian

  • Author_Institution
    Intel Corp., Hillsboro, OR, USA
  • fYear
    1997
  • fDate
    13-16 Jan 1997
  • Firstpage
    159
  • Lastpage
    163
  • Abstract
    Gracefully degrading systems represent a cost effective alternative to the massively redundant fault-tolerant computing systems. Assessing the effectiveness of these systems requires combined reliability and performance measures such as computational availability, performability and accumulated reward. This paper compares, for the first time, two numerical algorithms used for assessing the complementary distribution of the accumulated reward and the expected accumulated reward, respectively. Both methods are employed for analyzing a multiprocessor server. The first one, based on Laplace transforms, numerical evaluation of eigenvalues, and analytical and numerical inversion of the Laplace transforms, gives accurate results for low values of the accumulated reward. However, instability of the numerical inversion routine negatively affects the results when the accumulated reward approaches the maximum attainable performance of the system. The second method, which relies on randomization, proves to be insensitive to the performance level reached by the system. This approach is used to analyze the impact of the fault/error coverage probability, spare processing units, repair, and performance degradation on the expected accumulated reward of the server. We conclude that the randomization based method is a more accurate approach for assessing the reliability and performance of gracefully degrading systems
  • Keywords
    Laplace transforms; eigenvalues and eigenfunctions; fault tolerant computing; file servers; multiprocessing systems; numerical stability; redundancy; reliability; Laplace transforms; accumulated reward; complementary distribution; computational availability; eigenvalues; expected accumulated reward; fault/error coverage probability; gracefully degrading computer systems; massively redundant fault-tolerant computing systems; multiprocessor server; numerical algorithms; numerical inversion routine instability; numerical techniques; performability; performance assessment; performance degradation; randomization; reliability assessment; repair; spare processing units; Availability; Costs; Degradation; Distribution functions; Eigenvalues and eigenfunctions; Fault tolerant systems; Finite wordlength effects; Parallel machines; Performance analysis; Performance evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliability and Maintainability Symposium. 1997 Proceedings, Annual
  • Conference_Location
    Philadelphia, PA
  • ISSN
    0149-144X
  • Print_ISBN
    0-7803-3783-2
  • Type

    conf

  • DOI
    10.1109/RAMS.1997.571696
  • Filename
    571696