• DocumentCode
    1341085
  • Title

    Automated tuning of parallel I/O systems: an approach to portable I/O performance for scientific applications

  • Author

    Chen, Ying ; Winslett, Marianne

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    26
  • Issue
    4
  • fYear
    2000
  • fDate
    4/1/2000 12:00:00 AM
  • Firstpage
    362
  • Lastpage
    383
  • Abstract
    Parallel I/O systems typically consist of individual processors, communication networks, and a large number of disks. Managing and utilizing these resources to meet performance, portability, and usability goals of high performance scientific applications has become a significant challenge. For scientists, the problem is exacerbated by the need to retune the I/O portion of their code for each supercomputer platform where they obtain access. We believe that a parallel I/O system that automatically selects efficient I/O plans for user applications is a solution to this problem. The authors present such an approach for scientific applications performing collective I/O requests on multidimensional arrays. Under our approach, an optimization engine in a parallel I/O system selects high quality I/O plans without human intervention, based on a description of the application I/O requests and the system configuration. To validate our hypothesis, we have built an optimizer that uses rule based and randomized search based algorithms to tune parameter settings in Panda, a parallel I/O library for multidimensional arrays. Our performance results obtained from an IBM SP using an out-of-core matrix multiplication application show that the Panda optimizer is able to select high quality I/O plans and deliver high performance under a variety of system configurations with a small total optimization overhead
  • Keywords
    bibliographies; input-output programs; matrix multiplication; natural sciences computing; parallel programming; program diagnostics; software libraries; software portability; IBM SP; Panda; automated tuning; collective I/O requests; high quality I/O plans; human intervention; multidimensional arrays; optimization engine; out-of-core matrix multiplication application; parallel I/O library; parallel I/O system; parallel I/O systems; parameter settings; parameter tuning; portable I/O performance; randomized search based algorithms; scientific applications; supercomputer platform; system configuration; system configurations; total optimization overhead; usability goals; user applications; Analytical models; Application software; Communication networks; Computer Society; Cost function; Database systems; Humans; Multidimensional systems; Resource management; Usability;
  • fLanguage
    English
  • Journal_Title
    Software Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0098-5589
  • Type

    jour

  • DOI
    10.1109/32.844494
  • Filename
    844494