Title :
PRISM: an integrated architecture for scalable shared memory
Author :
Ekanadham, Kattamuri ; Lim, Beng-Hong ; Pattnaik, Pratap ; Snir, Marc
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
This paper describes PRISM, a distributed shared memory architecture that relies on a tightly integrated hardware and operating system design for scalable and reliable performance. PRISM´s hardware provides mechanisms for flexible management and dynamic configuration of shared memory pages with different behaviors. As an example, PRISM can configure individual shared memory pages in both CC-NUMA and Simple-COMA styles, maintaining the advantages of both without incorporating any of their disadvantages. PRISM´s operating system is structured as multiple independent kernels, where each kernel manages the resources on its local node. PRISM´s system structure minimizes the amount of global coordination when managing shared memory. Page faults do not involve global TLB invalidates, and pages can be replicated and migrated without requiring global coordination. The structure also provides natural fault containment boundaries around each node because physical addresses do not address remote memory directly. We simulate PRISM´s hardware, cache coherence protocol and memory management algorithms. Results from SPLASH applications on the simulated machine demonstrate a tradeoff between CC-NUMA and Simple-COMA styles of memory management. Adaptive, run-time policies that take advantage of PRISM´s ability to dynamically configure shared memory pages with different behaviors significantly outperform pure CC-NUMA or Simple-COMA configurations and are usually within 10% of optimal performance
Keywords :
cache storage; distributed memory systems; memory architecture; memory protocols; operating system kernels; paged storage; parallel architectures; performance evaluation; reconfigurable architectures; shared memory systems; CC-NUMA; PRISM; SPLASH applications; Simple-COMA; adaptive run-time policies; cache coherence protocol; distributed shared memory architecture; fault containment boundaries; hardware; memory management algorithms; multiple independent kernels; operating system design; page faults; page replication; reliable performance; scalable shared memory; shared memory page management; Computer architecture; Control systems; Electronic switching systems; Memory architecture; Memory management; Operating systems; Read only memory; Resource management; Runtime; Scalability;
Conference_Titel :
High-Performance Computer Architecture, 1998. Proceedings., 1998 Fourth International Symposium on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-8186-8323-6
DOI :
10.1109/HPCA.1998.650554