Title :
Exploring the future of out-of-core computing with compute-local non-volatile memory
Author :
Myoungsoo Jung ; Wilson, Ellis H. ; Wonil Choi ; Shalf, J. ; Aktulga, Hasan Metin ; Chao Yang ; Saule, Erik ; Catalyurek, Umit V. ; Kandemir, Mahmut
Author_Institution :
Dept. of Electr. Eng., Univ. of Texas at Dallas, Dallas, TX, USA
Abstract :
Drawing parallels to the rise of general purpose graphical processing units (GPGPUs) as accelerators for specific high-performance computing (HPC) workloads, there is a rise in the use of non-volatile memory (NVM) as accelerators for I/O-intensive scientific applications. However, existing works have explored use of NVM within dedicated I/O nodes, which are distant from the compute nodes that actually need such acceleration. As NVM bandwidth begins to out-pace point-to-point network capacity, we argue for the need to break from the archetype of completely separated storage. Therefore, in this work we investigate co-location of NVM and compute by varying I/O interfaces, file systems, types of NVM, and both current and future SSD architectures, uncovering numerous bottlenecks implicit in these various levels in the I/O stack. We present novel hardware and software solutions, including the new Unified File System (UFS), to enable fuller utilization of the new compute-local NVM storage. Our experimental evaluation, which employs a real-world Out-of-Core (OoC) HPC application, demonstrates throughput increases in excess of an order of magnitude over current approaches.
Keywords :
graphics processing units; multiprocessing systems; parallel processing; random-access storage; storage management; GPGPUs; HPC workloads; I/O interfaces; I/O-intensive scientific applications; NVM bandwidth; OoC HPC application; SSD architectures; UFS; accelerators; compute nodes; compute-local NVM storage; compute-local nonvolatile memory; dedicated I/O nodes; file systems; general purpose graphical processing units; high-performance computing; out-of-core computing; out-pace point-to-point network capacity; real-world out-of-core HPC application; unified file system; Abstracts; Computer architecture; Industries; Nonvolatile memory; Phase change materials; Prototypes; Random access memory;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2013 International Conference for
Conference_Location :
Denver, CO
Print_ISBN :
978-1-4503-2378-9
DOI :
10.1145/2503210.2503261