DocumentCode :
1085011
Title :
Comparative modeling and evaluation of CC-NUMA and COMA on hierarchical ring architectures
Author :
Zhang, Xiaodong ; Yan, Yong
Author_Institution :
High Performance Comput. & Software Lab., Texas Univ., San Antonio, TX, USA
Volume :
6
Issue :
12
fYear :
1995
fDate :
12/1/1995 12:00:00 AM
Firstpage :
1316
Lastpage :
1331
Abstract :
Parallel computing performance on scalable shared-memory architectures is affected by the structure of the interconnection networks linking processors to memory modules and on the efficiency of the memory/cache management systems. Cache Coherence Nonuniform Memory Access (CC-NUMA) and Cache Only Memory Access (COMA) are two effective memory systems, and the hierarchical ring structure is an efficient interconnection network in hardware. This paper focuses on comparative performance modeling and evaluation of CC-NUMA and COMA on a hierarchical ring shared-memory architecture. Analytical models for the two memory systems for comparative evaluation are presented. Intensive performance measurements on data migrations have been conducted on the KSR-1, a COMA hierarchical ring shared-memory machine. Experimental results support the analytical models, and we present practical observations and comparisons of the two cache coherence memory systems. Our analytical and experimental results show that a COMA system balances the work load well. However the overhead of frequent data movement may match the gains obtained from improving load balance. We believe our performance results could be further generalized to the two memory systems on a hierarchical network architecture. Although a CC-NUMA system may not automatically balance the load at the system level, it provides an option for a user to explicitly handle data locality for a possible performance improvement
Keywords :
cache storage; multiprocessor interconnection networks; parallel architectures; performance evaluation; shared memory systems; CC-NUMA; COMA; Cache Coherence Nonuniform Memory Access; Cache Only Memory Access; KSR-1; cache management systems; comparative performance modeling; data migrations; frequent data movement; hierarchical ring architectures; hierarchical ring shared-memory architecture; hierarchical ring structure; interconnection network; interconnection networks; memory modules; memory systems; parallel computing performance; scalable shared-memory architectures; slotted rings; Analytical models; Coherence; Computer architecture; Computer network management; Hardware; Joining processes; Measurement; Memory management; Multiprocessor interconnection networks; Parallel processing;
fLanguage :
English
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.476171
Filename :
476171
Link To Document :
بازگشت