• DocumentCode
    2258554
  • Title

    CCS resource management in networked HPC systems

  • Author

    Keller, Axel ; Reinefeld, Alexander

  • Author_Institution
    Paderborn Center for Parallel Comput., Paderborn Univ., Germany
  • fYear
    1998
  • fDate
    35884
  • Firstpage
    44
  • Lastpage
    56
  • Abstract
    CCS is a resource management system for parallel high-performance computers. At the user level, CCS provides vendor-independent access to parallel systems. At the system administrator level, CCS offers tools for controlling (i.e, specifying, configuring and scheduling) the system components that are operated in a computing center. Hence the name “Computing Center Software”. CCS provides: hardware-independent scheduling of interactive and batch jobs; partitioning of exclusive and non-exclusive resources; open, extensible interfaces to other resource management systems; a high degree of reliability (e.g. automatic restart of crashed daemons); fault tolerance in the case of network breakdowns. The authors describe CCS as one important component for the access, job distribution, and administration of networked HPC systems in a metacomputing environment
  • Keywords
    application program interfaces; batch processing (computers); fault tolerant computing; local area networks; parallel processing; processor scheduling; reliability; wide area networks; CCS resource management system; Computing Center Software; batch jobs; exclusive resource partitioning; fault tolerance; hardware-independent scheduling; interactive jobs; job distribution; metacomputing environment; network breakdowns; networked HPC systems; nonexclusive resource partitioning; open extensible interfaces; parallel high-performance computers; parallel systems; reliability; system administrator level; system component control; vendor-independent access; Carbon capture and storage; Computer networks; Distributed computing; Fault tolerance; Hardware; Intelligent networks; Metacomputing; Parallel processing; Processor scheduling; Resource management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Heterogeneous Computing Workshop, 1998. (HCW 98) Proceedings. 1998 Seventh
  • Conference_Location
    Orlando, FL
  • ISSN
    1097-5209
  • Print_ISBN
    0-8186-8365-1
  • Type

    conf

  • DOI
    10.1109/HCW.1998.666544
  • Filename
    666544