Title :
Parallel file system testing for the lunatic fringe: the care and feeding of restless I/O power users
Author :
Hedges, Richard ; Loewe, Bill ; McLarty, Tyce ; Morrone, Chris
Author_Institution :
Scalable I/O Project, Lawrence Livermore Nat. Lab., Berkeley, CA, USA
Abstract :
Over the last several years there has been a major thrust at the Lawrence Livermore National Laboratory toward building extremely large scale computing clusters based on open source software and commodity hardware. On the storage front, our efforts have focused upon the development of the Lustre file system and bringing it into production in our computer center. Given our customers´ requirements, it is assured that we will be living on the bleeding edge with this file system software as we press it into production. A further reality is that our partners are not able to duplicate the scale of systems as required for these testing purposes. For these practical reasons, the onus for file system testing at scale has fallen largely upon us. As an integral part of our testing efforts, we have developed programs for stress and performance testing of parallel file systems. This paper focuses on these unique test programs and upon how we apply them to understand the usage and failure modes of such large-scale parallel file systems.
Keywords :
network operating systems; program testing; storage management; I/O power users; Lawrence Livermore National Laboratory; Lustre file system; commodity hardware; large scale computing clusters; lunatic fringe; open source software; parallel file system testing; performance testing; stress testing; File systems; Hemorrhaging; Laboratories; Large-scale systems; Open source hardware; Open source software; Production systems; Software systems; Stress; System testing;
Conference_Titel :
Mass Storage Systems and Technologies, 2005. Proceedings. 22nd IEEE / 13th NASA Goddard Conference on
Print_ISBN :
0-7695-2318-8
DOI :
10.1109/MSST.2005.22