Title :
Scalable Resource Management System for High Productive Computing
Author :
Lu, Yutong ; Xiao, Nong ; Yang, Xuejun
Author_Institution :
Sch. of Comput. Sci. & Technol., Nat. Univ. of defense Technol., Changsha
Abstract :
High performance computing is focused on providing high productivity computing system (HPCS), instead of seeking high performance only. HPCS needs more scalable and powerful resource management system. This paper proposes scalable hierarchy resource management architecture with cascade services to support the scalability of HPCS. We design a method of dynamic self-organization services configuration, optimize communication protocol for system management, and construct virtual topology tree to reduce the overhead of resource management system and quicken the large scale parallel job loading. A scalable resource management system (SRMS) have been implemented, and some experiments have been done to evaluate the scalability of SRMS.
Keywords :
parallel processing; resource allocation; cascade service; dynamic self-organization service; high performance computing; high productivity computing system; optimize communication protocol; parallel job loading; parallel process system; resource management architecture; scalable resource management system; system management; virtual topology tree; Computer architecture; Design methodology; Design optimization; High performance computing; Job design; Power system management; Productivity; Protocols; Resource management; Scalability; Cascade Service; High Productive Computing; Job loading; Resource Management; Scalable;
Conference_Titel :
ChinaGrid Annual Conference, 2008. ChinaGrid '08. The Third
Conference_Location :
Dunhuang, Gansu
Print_ISBN :
978-0-7695-3306-3
DOI :
10.1109/ChinaGrid.2008.39