DocumentCode
2258554
Title
CCS resource management in networked HPC systems
Author
Keller, Axel ; Reinefeld, Alexander
Author_Institution
Paderborn Center for Parallel Comput., Paderborn Univ., Germany
fYear
1998
fDate
35884
Firstpage
44
Lastpage
56
Abstract
CCS is a resource management system for parallel high-performance computers. At the user level, CCS provides vendor-independent access to parallel systems. At the system administrator level, CCS offers tools for controlling (i.e, specifying, configuring and scheduling) the system components that are operated in a computing center. Hence the name “Computing Center Software”. CCS provides: hardware-independent scheduling of interactive and batch jobs; partitioning of exclusive and non-exclusive resources; open, extensible interfaces to other resource management systems; a high degree of reliability (e.g. automatic restart of crashed daemons); fault tolerance in the case of network breakdowns. The authors describe CCS as one important component for the access, job distribution, and administration of networked HPC systems in a metacomputing environment
Keywords
application program interfaces; batch processing (computers); fault tolerant computing; local area networks; parallel processing; processor scheduling; reliability; wide area networks; CCS resource management system; Computing Center Software; batch jobs; exclusive resource partitioning; fault tolerance; hardware-independent scheduling; interactive jobs; job distribution; metacomputing environment; network breakdowns; networked HPC systems; nonexclusive resource partitioning; open extensible interfaces; parallel high-performance computers; parallel systems; reliability; system administrator level; system component control; vendor-independent access; Carbon capture and storage; Computer networks; Distributed computing; Fault tolerance; Hardware; Intelligent networks; Metacomputing; Parallel processing; Processor scheduling; Resource management;
fLanguage
English
Publisher
ieee
Conference_Titel
Heterogeneous Computing Workshop, 1998. (HCW 98) Proceedings. 1998 Seventh
Conference_Location
Orlando, FL
ISSN
1097-5209
Print_ISBN
0-8186-8365-1
Type
conf
DOI
10.1109/HCW.1998.666544
Filename
666544
Link To Document