Title :
The design and architecture of the Microsoft Cluster Service-a practical approach to high-availability and scalability
Author :
Vogels, Werner ; Dumitriu, Dan ; Birman, Ken ; Gamache, Rod ; Massa, Mike ; Short, Rob ; Vert, John ; Barrera, Joe ; Gray, Jim
Author_Institution :
Dept. of Comput. Sci., Cornell Univ., Ithaca, NY, USA
Abstract :
Microsoft Cluster Service (MSCS) extends the Windows NT operating system to support high-availability services. The goal is to offer an execution environment where off-the-shelf server applications can continue to operate, even in the presence of node failures. Later versions of MSCS will provide scalability via a node and application management system which allows applications to scale to hundreds of nodes. In this paper we provide a detailed description of the MSCS architecture and the design decisions that have driven the implementation of the service. The paper also describes how some major applications use the MSCS features, and describes features added to make it easier to implement and manage fault-tolerant applications on MSCS.
Keywords :
network operating systems; software fault tolerance; system recovery; Microsoft Cluster Service; Windows NT operating system; availability; fault-tolerant applications; high-availability services; node and application management system; node failures; off-the-shelf server applications; scalability; Algorithm design and analysis; Application software; Clustering algorithms; Computer architecture; Computer science; Contracts; Distributed computing; Operating systems; Power system management; Scalability;
Conference_Titel :
Fault-Tolerant Computing, 1998. Digest of Papers. Twenty-Eighth Annual International Symposium on
Conference_Location :
Munich, Germany
Print_ISBN :
0-8186-8470-4
DOI :
10.1109/FTCS.1998.689494