• DocumentCode
    1523281
  • Title

    Design and Evaluation of Multiple-Level Data Staging for Blue Gene Systems

  • Author

    Isaila, Florin ; Blas, Javier Garcia ; Carretero, Jesus ; Latham, Robert ; Ross, Robert

  • Author_Institution
    Univ. Carlos III of Madrid, Leganés, Spain
  • Volume
    22
  • Issue
    6
  • fYear
    2011
  • fDate
    6/1/2011 12:00:00 AM
  • Firstpage
    946
  • Lastpage
    959
  • Abstract
    Parallel applications currently suffer from a significant imbalance between computational power and available I/O bandwidth. Additionally, the hierarchical organization of current Petascale systems contributes to an increase of the I/O subsystem latency. In these hierarchies, file access involves pipelining data through several networks with incremental latencies and higher probability of congestion. Future Exascale systems are likely to share this trait. This paper presents a scalable parallel I/O software system designed to transparently hide the latency of file system accesses to applications on these platforms. Our solution takes advantage of the hierarchy of networks involved in file accesses, to maximize the degree of overlap between computation, file I/O-related communication, and file system access. We describe and evaluate a two-level hierarchy for Blue Gene systems consisting of client-side and I/O node-side caching. Our file cache management modules coordinate the data staging between application and storage through the Blue Gene networks. The experimental results demonstrate that our architecture achieves significant performance improvements through a high degree of overlap between computation, communication, and file I/O.
  • Keywords
    cache storage; input-output programs; parallel processing; pipeline processing; Exascale systems; Petascale systems; blue gene systems; data pipelining; file IO-related communication; file access; file cache management; file system access; multiple-level data staging design; multiple-level data staging evaluation; parallel IO software system; parallel applications; Application software; Bandwidth; Computer networks; Concurrent computing; Delay; File systems; Petascale computing; Pipeline processing; Software design; Software systems; MPI-IO; parallel I/O; parallel file systems; supercomputers.;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2010.127
  • Filename
    5492680