• DocumentCode
    3291995
  • Title

    Recent Successes and Changes of the HPCMP Sustained Systems Performance Test

  • Author

    Bennett, Paul M. ; Brown, L.L.

  • Author_Institution
    Eng. R&D Center, Comput. Sci. & Eng. Group, US Army, Vicksburg, MS, USA
  • fYear
    2010
  • fDate
    14-17 June 2010
  • Firstpage
    453
  • Lastpage
    462
  • Abstract
    The sustained systems performance (SSP) test has been implemented on certain High Performance Computing Modernization Program (HPCMP) HPC systems in order to quantitatively evaluate updates to system software, hardware repairs, job queuing policy modifications, and revisions to the job scheduler as necessary. The test employs codes used in the system acquisition cycle with proven migration capability to HPCMP HPC systems and non-empirical tests for numerical accuracy. Metrics such as compilation time, queue wait time, benchmark execution time, and total test throughput time are gathered and compared against metric data from previous tests to monitor the systems under test while minimizing impact to the users. Jobs failing to execute properly or in anomalously short or long times are investigated, and the results are reported to system administrators and center directors at each center for appropriate actions. During the past year, the SSP test has been instrumental in surfacing configuration issues with the PBS scheduler and performance issues on several HPC systems. Additionally, the frequency of the SSP test on systems procured in Technology Insertion 2009 (TI-09) and thereafter has increased, with attendant changes in the test cases comprising the test. The SSP test continues to play an important role in monitoring the quality of service delivering HPC to HPCMP users at the system, DoD Supercomputing Resource Center, and vendor levels.
  • Keywords
    mainframes; military computing; performance evaluation; processor scheduling; program testing; systems analysis; systems software; DoD supercomputing resource center; HPC system; HPCMP sustained systems performance test; hardware repairs; high performance computing modernization program; job queuing policy modification; job scheduler; system acquisition cycle; system administrator; system software updates; Benchmark testing; Computational modeling; Libraries; Memory management; Oceans; Throughput; US Department of Defense;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC), 2010 DoD
  • Conference_Location
    Schaumburg, IL
  • Print_ISBN
    978-1-61284-986-7
  • Type

    conf

  • DOI
    10.1109/HPCMP-UGC.2010.46
  • Filename
    6018026