DocumentCode :
2050490
Title :
Remove the memory wall: from performance modeling to architecture optimization
Author :
Sun, Xian-He
Author_Institution :
Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL
fYear :
2006
fDate :
25-29 April 2006
Abstract :
Summary form only given. Data access is a known bottleneck of high performance computing (HPC). The prime sources of this bottleneck are the performance gap between the processor and memory storage and the large memory requirements of ever-hungry applications. Although advanced memory hierarchies and parallel file systems have been developed in recent years, they only provide high bandwidth for contiguous, well-formed data streams, performing poorly for accessing small, noncontiguous data. Unfortunately, many HPC applications make a large number of requests for small and noncontiguous pieces of data, as do high-level I/O libraries such as HDF-5. The problematic memory wall remains after years of study and, in fact, is becoming the most important issue of HPC. We propose a new I/O architecture for HPC. Unlike traditional I/O designs where data is stored and retrieved by request, our architecture is based on a novel "server-push" model in which a data access server proactively pushes data from a file server to the compute node\´s memory or to it\´s cache directly based on the architecture design. Simulation results show that with the new approach the cache hit rates increase well above 90% for various benchmark applications that are notorious for poor cache performance. Performance evaluation is the driven force of the push-based model. Mechanisms of performance modeling, evaluation, and optimization are applied to data access pattern identification, prefetching algorithm design, data replacement strategy development, and architecture optimization to enable the "server-push" model. Our current success illustrates the power and unique role of performance evaluation in computing
Keywords :
file servers; input-output programs; memory architecture; parallel processing; performance evaluation; HDF-5; I/O architecture; architecture optimization; data access pattern identification; data replacement strategy development; file server; high performance computing; high-level I/O library; memory storage; memory wall; parallel file system; performance evaluation; performance modeling; prefetching algorithm design; server-push model; Bandwidth; Computational modeling; Computer architecture; Design optimization; File servers; File systems; High performance computing; Information retrieval; Libraries; Prefetching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
Conference_Location :
Rhodes Island
Print_ISBN :
1-4244-0054-6
Type :
conf
DOI :
10.1109/IPDPS.2006.1639621
Filename :
1639621
Link To Document :
بازگشت