Title :
System support for many task computing
Author :
Van Hensbergen, Eric ; Minnich, Ron
Author_Institution :
IBM Res., Austin, TX
Abstract :
The popularity of large scale systems such as Blue Gene has extended their reach beyond HPC into the realm of commercial computing. There is a desire in both communities to broaden the scope of these machines from tightly-coupled scientific applications running on MPI frameworks to more general-purpose workloads. Our approach deals with issues of scale by leveraging the huge number of nodes to distribute operating systems services and components across the machine, tightly coupling the operating system and the interconnects to take maximum advantage of the unique capabilities of the HPC system. We plan on provisioning nodes to provide workload execution, aggregation, and system services, and dynamically re-provisioning nodes as necessary to accommodate changes, failure, and redundancy. By incorporating aggregation as a first-class system construct, we will provide dynamic hierarchical organization and management of all system resources. In this paper, we will go into the design principles of our approach using file systems, workload distribution and system monitoring as illustrative examples. Our end goal is to provide a cohesive distributed system which can broaden the class of applications for large scale systems and also make them more approachable for a larger class of developers and end users.
Keywords :
application program interfaces; file organisation; message passing; operating systems (computers); Blue Gene; MPI frameworks; distribute operating systems services; file systems; large scale systems; many task computing; system monitoring; workload distribution; Application software; Computer networks; Ethernet networks; File systems; Large-scale systems; Monitoring; Network topology; Operating systems; Resource management; Scalability;
Conference_Titel :
Many-Task Computing on Grids and Supercomputers, 2008. MTAGS 2008. Workshop on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-2872-4
DOI :
10.1109/MTAGS.2008.4777907