Author :
Röblitz, Thomas ; Schintke, Florian ; Reinefeld, Alexander
Abstract :
Clusters provide an outstanding cost/performance ratio, but their efficient orchestration, i.e. their cooperative management, maintenance, and use, still poses difficulties. Moreover, many sites operate multiple clusters, each possible running under a different cluster management system. In this paper, we present an architectural scheme for the coordinated management of multiple clusters in a fabric. Our scheme allows different cluster management systems to interact with each other via adaptors, thereby providing interoperability within a single administrative entity, the fabric. Using adaptors, various jobs (grid jobs, local jobs, system maintenance jobs) can be served by the same methods, and existing cluster management software (like LSF, PBS, CCS, etc.) can be extended by additional functions without much modification effort.
Keywords :
computer communications software; computer network management; network operating systems; workstation clusters; adaptors; cluster management system; cooperative management; coordinated management; cost-performance ratio; fabric; grid jobs; interoperability; job management perspective; local jobs; multiple clusters; system maintenance jobs; Carbon capture and storage; Communication system software; Computer network management; Computer networks; Conference management; Costs; Fabrics; Humans; Large-scale systems; Network operating systems; Project management; Scheduling; Software maintenance;