Title :
On-line recovery for rediscovered software problems
Author :
Lee, Inhwan ; McRee, Randy ; Bartlett, Wendy
Author_Institution :
Tandem Comput. Inc., Cupertino, CA, USA
Abstract :
This paper discusses a method that can allow a system to avoid or recover from the exercise of certain known faults at runtime and thus make certain software upgrades unnecessary. The method uses the knowledge of the characteristic symptoms of a software fault and the appropriate recovery action for the fault to detect and recover from the future exercise of the fault. An analysis of field data shows that the method is applicable to about 25% of faults and there is a potential for this number to go up. Using the method can not only improve the availability of user applications, but can also reduce the number of rediscovered problems and hence reduce the resources required for software service
Keywords :
software performance evaluation; characteristic symptoms; field data; online recovery; rediscovered software problems; software fault; software service; software upgrades; Application software; Availability; Data analysis; Failure analysis; Fault detection; Kernel; Multiprocessing systems; Programming; Runtime; Software maintenance;
Conference_Titel :
Computer Performance and Dependability Symposium, 1996., Proceedings of IEEE International
Conference_Location :
Urbana-Champaign, IL
Print_ISBN :
0-8186-7484-9
DOI :
10.1109/IPDS.1996.540209