DocumentCode :
3412981
Title :
STL: a tool for on-line software update and rejuvenation
Author :
Yajnik, Shalini ; Huang, Yennun
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
fYear :
35735
fDate :
2-5 Nov1997
Firstpage :
258
Abstract :
Summary form only given, as follows. A large number of tools and techniques have been developed in the past to achieve a 24×7 system availability (24 hours a day and 7 days a week) by reducing unscheduled system downtime due to failures. However, a highly available or fault-tolerant system may still have to be taken off-line for software and hardware updates, maintenance and rejuvenation. Therefore, the scheduled downtime for maintenance could become the major source of system unavailability. One big challenge in a highly available system is to keep the system running while it is undergoing software updates or bug fixes. In this paper, we describe a tool that can be used to perform an online update of software in a cluster environment. The tool consists of a protocol compiler (stgen) and a library (libst) for marshaling and unmarshaling data during a software update. The tool has the ability to transfer complex data structures between two processes even if the data definitions in the two processes are different. The data transfer format is machine-independent. Hence, the tool can transfer data across processes running on different machine types. The paper describes some real-life applications of the tool and presents performance measurements of the tool for these applications
Keywords :
computer aided software engineering; data structures; online operation; program compilers; protocols; software fault tolerance; software libraries; software maintenance; software performance evaluation; software tools; STL software tool; Standard Tool Library; bug fixes; call processing; cluster environment; complex data structure transfer; data definitions; data marshaling; fail-over; fault-tolerant system; hardware maintenance; hardware updates; libst library; machine-independent data transfer format; online software updating; performance measurements; software maintenance; software rejuvenation; stgen protocol compiler; system availability; system failures; unscheduled system downtime; Application software; Data structures; Fault tolerant systems; Hardware; Protocols; Software debugging; Software libraries; Software maintenance; Software performance; Software tools;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Reliability Engineering, 1997. Proceedings., The Eighth International Symposium on
Conference_Location :
Albuquerque, NM
Print_ISBN :
0-8186-8120-9
Type :
conf
DOI :
10.1109/ISSRE.1997.630872
Filename :
630872
Link To Document :
بازگشت