• DocumentCode
    1926599
  • Title

    Scalable I/O forwarding framework for high-performance computing systems

  • Author

    Ali, Nawab ; Carns, Philip ; Iskra, Kamil ; Kimpe, Dries ; Lang, Samuel ; Latham, Robert ; Ross, Robert ; Ward, Lee ; Sadayappan, P.

  • Author_Institution
    Ohio State Univ., Columbus, OH, USA
  • fYear
    2009
  • fDate
    Aug. 31 2009-Sept. 4 2009
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Current leadership-class machines suffer from a significant imbalance between their computational power and their I/O bandwidth. While Moore´s law ensures that the computational power of high-performance computing systems increases with every generation, the same is not true for their I/O subsystems. The scalability challenges faced by existing parallel file systems with respect to the increasing number of clients, coupled with the minimalistic compute node kernels running on these machines, call for a new I/O paradigm to meet the requirements of data-intensive scientific applications. I/O forwarding is a technique that attempts to bridge the increasing performance and scalability gap between the compute and I/O components of leadership-class machines by shipping I/O calls from compute nodes to dedicated I/O nodes. The I/O nodes perform operations on behalf of the compute nodes and can reduce file system traffic by aggregating, rescheduling, and caching I/O requests. This paper presents an open, scalable I/O forwarding framework for high-performance computing systems. We describe an I/O protocol and API for shipping function calls from compute nodes to I/O nodes, and we present a quantitative analysis of the overhead associated with I/O forwarding.
  • Keywords
    application program interfaces; file organisation; parallel machines; scheduling; API; I/O request aggregation; I/O request caching; I/O request rescheduling; I/O subsystems; data-intensive scientific applications; high-performance computing systems; parallel file systems; scalable I/O forwarding framework; Bandwidth; Concurrent computing; File systems; Kernel; Laboratories; Libraries; Moore´s Law; Power generation; Scalability; Supercomputers; I/O forwarding; Leadership-class machines; Parallel file systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and Workshops, 2009. CLUSTER '09. IEEE International Conference on
  • Conference_Location
    New Orleans, LA
  • ISSN
    1552-5244
  • Print_ISBN
    978-1-4244-5011-4
  • Electronic_ISBN
    1552-5244
  • Type

    conf

  • DOI
    10.1109/CLUSTR.2009.5289188
  • Filename
    5289188