Title :
Restart services for highly available systems
Author :
Bowen, Nicholas S. ; Polyzois, Christos A. ; Regan, Richard D.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
This paper proposes a design methodology for building highly available systems. In addition, we describe a set of operating system services that can be used to achieve this goal. The techniques described are intended for a parallel environment and can be generalized for any distributed system. We describe a methodology for providing basic services for high availability, specific services for restart and an implementation of these services
Keywords :
fault tolerant computing; multiprocessing systems; operating systems (computers); system recovery; distributed system; highly available systems; operating system services; parallel environment; restart; Availability; Buildings; Concurrent computing; Design methodology; Environmental economics; Hardware; Humans; Information technology; Operating systems; Workstations;
Conference_Titel :
Parallel and Distributed Processing, 1995. Proceedings. Seventh IEEE Symposium on
Conference_Location :
San Antonio, TX
Print_ISBN :
0-81867195-5
DOI :
10.1109/SPDP.1995.530737