DocumentCode
1332588
Title
Impact of CC-NUMA memory management policies on the application performance of multistage switching networks
Author
Bhuyan, Laxmi N. ; Iyer, Ravi ; Wang, Hu-jun ; Kumar, Akhilesh
Author_Institution
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX, USA
Volume
11
Issue
3
fYear
2000
fDate
3/1/2000 12:00:00 AM
Firstpage
230
Lastpage
246
Abstract
In this paper, the impact of memory management policies and switch design alternatives on the application performance of cache-coherent nonuniform memory access (CC-NUMA) multiprocessors is studied in detail. Memory management plays an important role in determining the performance of NUMA multiprocessors by dictating the placement of data among the distributed memory modules. We analyze memory traces of several scientific applications for three different memory management techniques, namely buddy, round-robin, and first-touch policies, and compare their memory system performance. Interconnection network switch designs that consider virtual channels and varying number of input buffers per switch are presented. Our performance evaluation is based on execution-driven simulation methodology to capture the dynamic changes in the network traffic during execution of the applications. It is shown that the use of cut-through switching with buffers and virtual channels can Improve the average message latency tremendously. However, the choice of memory management policy affects the amount of network traffic and the network access pattern. Thus, we vary the memory management policy and confirm the performance benefits of improved switch designs. Results of sensitivity studies by varying switch design parameters, cache block size, and memory page size are also presented. We find that a combination of first-touch memory management policy and a switch design with virtual channels and increased buffer space can reduce the average message latency by as high as 70 percent
Keywords
multistage interconnection networks; shared memory systems; storage management; CC-NUMA memory management; buddy; cache-coherent; first-touch; memory management; memory system performance; multistage switching networks; round-robin; Application software; Computer aided manufacturing; Delay; File servers; Memory management; Multiprocessor interconnection networks; Switches; System performance; Telecommunication traffic; Traffic control;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/71.841740
Filename
841740
Link To Document