• DocumentCode
    62481
  • Title

    Mission Reliability, Cost and Time for Cold Standby Computing Systems with Periodic Backup

  • Author

    Levitin, Gregory ; Liudong Xing ; Johnson, Barry W. ; Yuanshun Dai

  • Author_Institution
    Collaborative Autonomic Comput. Lab., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • Volume
    64
  • Issue
    4
  • fYear
    2015
  • fDate
    April 1 2015
  • Firstpage
    1043
  • Lastpage
    1057
  • Abstract
    Life critical applications like space missions and flight controls require their computing systems to be equipped with some fault-tolerance mechanism to meet stringent reliability requirements by performing the intended function even in the case of element failures. Such benefit, however, cannot come without extra time as well as extra overhead and capital costs. This paper for the first time considers the modeling and evaluation of mission reliability, expected mission time and cost simultaneously for 1-out-of-N: G nonrepairable cold standby computing systems subject to periodic backup actions. Based on the suggested numerical evaluation method, the optimal backup frequency problems are formulated and solved, providing the optimal number of backup operations during the mission to maximize the system reliability or to minimize the mission cost or time. In the case of non-identical system elements, the optimal standby element sequencing problem arises as the order in which the system elements are initiated can impact the system reliability and mission cost and time greatly; such problems are formulated and solved for the 1-out-of-N: G cold standby computing systems with periodic backups. Furthermore, a combined optimization problem is considered, where a combination of the element initiation sequence and backup frequency providing the best combination of mission reliability, cost, and time is found. The proposed methodology can facilitate a reliability-cost-time tradeoff study in the practical design of cold standby systems, thus assist in making the optimal decision on the system´s standby and backup policy. Examples are provided for illustrating the considered problems and suggested solution methodology.
  • Keywords
    aerospace computing; air traffic control; fault tolerant computing; optimisation; reliability; cold standby computing systems; combined optimization problem; fault-tolerance mechanism; mission reliability; numerical evaluation method; optimal backup frequency problems; periodic backup; reliability-cost-time tradeoff; Electronic mail; Genetic algorithms; Optimization; Reliability engineering; Switches; Time-frequency analysis; Cold standby; mission cost; mission time; optimization; periodic backup; reliability;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2014.2315644
  • Filename
    6782718