Title :
Pink: a 1024-node single-system image Linux cluster
Author :
Watson, Gregory R. ; Sottile, Matthew J. ; Minnich, Ronald G. ; Choi, Sung-Eun ; Hertdriks, E.A.
Author_Institution :
Adv. Comput. Lab., Los Alamos Nat. Lab., NM, USA
Abstract :
This work describes our experience of designing and building Pink, a 1024-node (2048 processor) Myrinet-based single-system image Linux cluster that was installed in January 2003 at the Los Alamos National Laboratory. At the time of its installation, Pink was the largest single-system image Linux cluster in the world, and was based entirely on open-source software - from the BIOS up. Pink was the proof-of-concept prototype for Lightning, a production 1408-node (2816 processor) cluster that begin operation at LANL. Lightning is currently number 6 on the Top500 list. In This work we examine the issues that were encountered and the problems that needed to be overcome in order to scale a cluster to this size. We also present some performance numbers that demonstrate the scalability and manageability of the cluster software suite.
Keywords :
computer communications software; network computers; open systems; workstation clusters; 1024-node single-system image Linux cluster; 2048 processor; Lightning; Los Alamos National Laboratory; Myrinet-based single-system image Linux cluster; Pink; cluster software suite; open-source software; proof-of-concept prototype; Buildings; Laboratories; Lightning; Linux; Open source software; Production; Prototypes; Scalability; Software performance; Software prototyping;
Conference_Titel :
High Performance Computing and Grid in Asia Pacific Region, 2004. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-2138-X
DOI :
10.1109/HPCASIA.2004.1324076