DocumentCode
2792887
Title
A Flexible Resource Management Architecture for the Blue Gene/P Supercomputer
Author
Miller, Sam ; Megerian, Mark ; Allen, Paul ; Budnik, Tom
Author_Institution
IBM Syst. & Technol. Group, Rochester, MN
fYear
2007
fDate
26-30 March 2007
Firstpage
1
Lastpage
8
Abstract
Blue Genereg/P are a massively parallel supercomputer intended as the successor to Blue Gene/L. It leverages much of the existing architecture of its predecessor to provide scalability up to a petaflop of peak computing power. The resource management software for such a large parallel system faces several challenges, including system fragmentation due to partitioning, presenting resource usage information using a polling or event driven model, and acting as a barrier between external resource management systems and the Blue Gene/P core. This paper describes how the Blue Gene/P resource management architecture is extremely flexible by providing multiple methodologies for obtaining resource usage information to make scheduling decisions. Three distinctly separate resource management services can be described. First, the Bridge API, a full-featured API suitable for fine tuning scheduling and allocation decisions. Second, a light-weight allocator API for allocating resources without substantial development costs. And lastly, configuring the system into static partitions. Job scheduling strategies utilizing each of the methods can be discussed.
Keywords
application program interfaces; parallel machines; resource allocation; scheduling; API; Blue Gene/P; application program interface; job scheduling; parallel supercomputer; peak computing power; resource allocation; resource management architecture; Bridges; Computer architecture; Concurrent computing; Costs; Engines; Power system modeling; Processor scheduling; Resource management; Scalability; Supercomputers;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location
Long Beach, CA
Print_ISBN
1-4244-0910-1
Electronic_ISBN
1-4244-0910-1
Type
conf
DOI
10.1109/IPDPS.2007.370628
Filename
4228356
Link To Document