DocumentCode :
3240062
Title :
STORM: Lightning-Fast Resource Management
Author :
Frachtenberg, Eitan ; Petrini, Fabrizio ; Fernandez, Juan ; Pakin, Scott ; Coll, Salvador
Author_Institution :
Los Alamos National Laboratory
fYear :
2002
fDate :
16-22 Nov. 2002
Firstpage :
46
Lastpage :
46
Abstract :
Although workstation clusters are a common platform for high-performance computing (HPC), they remain more difficult to manage than sequential systems or even symmetric multiprocessors. Furthermore, as cluster sizes increase, the quality of the resource-management subsystem — essentially, all of the code that runs on a cluster other than the applications — increasingly impacts application efficiency. In this paper, we present STORM, a resource-management framework designed for scalability and performance. The key innovation behind STORM is a software architecture that enables resource management to exploit low-level network features. As a result of this HPC-application-like design, STORM is orders of magnitude faster than the best reported results in the literature on two sample resource-management functions: job launching and process scheduling.
Keywords :
Application software; Clustering algorithms; Informatics; Job design; Laboratories; Resource management; Scalability; Storms; Usability; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, ACM/IEEE 2002 Conference
ISSN :
1063-9535
Print_ISBN :
0-7695-1524-X
Type :
conf
DOI :
10.1109/SC.2002.10057
Filename :
1592882
Link To Document :
بازگشت