• DocumentCode
    3317266
  • Title

    Pink: a 1024-node single-system image Linux cluster

  • Author

    Watson, Gregory R. ; Sottile, Matthew J. ; Minnich, Ronald G. ; Choi, Sung-Eun ; Hertdriks, E.A.

  • Author_Institution
    Adv. Comput. Lab., Los Alamos Nat. Lab., NM, USA
  • fYear
    2004
  • fDate
    20-22 July 2004
  • Firstpage
    454
  • Lastpage
    461
  • Abstract
    This work describes our experience of designing and building Pink, a 1024-node (2048 processor) Myrinet-based single-system image Linux cluster that was installed in January 2003 at the Los Alamos National Laboratory. At the time of its installation, Pink was the largest single-system image Linux cluster in the world, and was based entirely on open-source software - from the BIOS up. Pink was the proof-of-concept prototype for Lightning, a production 1408-node (2816 processor) cluster that begin operation at LANL. Lightning is currently number 6 on the Top500 list. In This work we examine the issues that were encountered and the problems that needed to be overcome in order to scale a cluster to this size. We also present some performance numbers that demonstrate the scalability and manageability of the cluster software suite.
  • Keywords
    computer communications software; network computers; open systems; workstation clusters; 1024-node single-system image Linux cluster; 2048 processor; Lightning; Los Alamos National Laboratory; Myrinet-based single-system image Linux cluster; Pink; cluster software suite; open-source software; proof-of-concept prototype; Buildings; Laboratories; Lightning; Linux; Open source software; Production; Prototypes; Scalability; Software performance; Software prototyping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Grid in Asia Pacific Region, 2004. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-2138-X
  • Type

    conf

  • DOI
    10.1109/HPCASIA.2004.1324076
  • Filename
    1324076