DocumentCode :
1175840
Title :
A two-level directory architecture for highly scalable cc-NUMA multiprocessors
Author :
Acacio, Manuel E. ; González, José ; Garcia, José M. ; Duato, José
Author_Institution :
Dept. de Ingenieria y Tecnologia de Computadores, Murcia Univ., Spain
Volume :
16
Issue :
1
fYear :
2005
Firstpage :
67
Lastpage :
79
Abstract :
One important issue the designer of a scalable shared-memory multiprocessor must deal with is the amount of extra memory required to store the directory information. It is desirable that the directory memory overhead be kept as low as possible, and that it scales very slowly with the size of the machine. Unfortunately, current directory architectures provide scalability at the expense of performance. This work presents a scalable directory architecture that significantly reduces the size of the directory for large-scale configurations of a multiprocessor without degrading performance. First, we propose multilayer clustering as an effective approach to reduce the width of directory entries. Based on this concept, we derive three new compressed sharing codes, some of them with a space complexity of O(log2(log2(N))) for an N-node system. Then, we present a novel two-level directory architecture to eliminate the penalty caused by compressed directories in general. The proposed organization consists of a small full-map first-level directory (which provides precise information for the most recently referenced lines) and a compressed second-level directory (which provides in-excess information for all the lines). The proposals are evaluated based on extensive execution-driven simulations (using RSIM) of a 64-node cc-NUMA multiprocessor. Results demonstrate that a system with a two-level directory architecture achieves the same performance as a multiprocessor with a big and nonscalable full-map directory, with a very significant reduction of the memory overhead.
Keywords :
computational complexity; memory architecture; multiprocessor interconnection networks; parallel architectures; shared memory systems; N-node system; cc-NUMA multiprocessor; compressed sharing codes; execution-driven simulation; full-map first-level directory; multilayer clustering; scalable directory architecture; scalable shared-memory multiprocessor; space complexity; two-level directory architecture; Bandwidth; Broadcasting; Computer Society; Computer architecture; Degradation; Large-scale systems; Nonhomogeneous media; Proposals; Protocols; Scalability; 65; Scalability; cc-NUMA multiprocessor.; compressed sharing codes; directory memory overhead; two-level directory architecture; unnecessary coherence messages;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/TPDS.2005.4
Filename :
1363753
Link To Document :
بازگشت