PRISM: an integrated architecture for scalable shared memory

Author

Ekanadham, Kattamuri ; Lim, Beng-Hong ; Pattnaik, Pratap ; Snir, Marc

Author_Institution

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

fYear

1998

fDate

1-4 Feb 1998

Firstpage

140

Lastpage

151

Abstract

This paper describes PRISM, a distributed shared memory architecture that relies on a tightly integrated hardware and operating system design for scalable and reliable performance. PRISM´s hardware provides mechanisms for flexible management and dynamic configuration of shared memory pages with different behaviors. As an example, PRISM can configure individual shared memory pages in both CC-NUMA and Simple-COMA styles, maintaining the advantages of both without incorporating any of their disadvantages. PRISM´s operating system is structured as multiple independent kernels, where each kernel manages the resources on its local node. PRISM´s system structure minimizes the amount of global coordination when managing shared memory. Page faults do not involve global TLB invalidates, and pages can be replicated and migrated without requiring global coordination. The structure also provides natural fault containment boundaries around each node because physical addresses do not address remote memory directly. We simulate PRISM´s hardware, cache coherence protocol and memory management algorithms. Results from SPLASH applications on the simulated machine demonstrate a tradeoff between CC-NUMA and Simple-COMA styles of memory management. Adaptive, run-time policies that take advantage of PRISM´s ability to dynamically configure shared memory pages with different behaviors significantly outperform pure CC-NUMA or Simple-COMA configurations and are usually within 10% of optimal performance

Keywords

cache storage; distributed memory systems; memory architecture; memory protocols; operating system kernels; paged storage; parallel architectures; performance evaluation; reconfigurable architectures; shared memory systems; CC-NUMA; PRISM; SPLASH applications; Simple-COMA; adaptive run-time policies; cache coherence protocol; distributed shared memory architecture; fault containment boundaries; hardware; memory management algorithms; multiple independent kernels; operating system design; page faults; page replication; reliable performance; scalable shared memory; shared memory page management; Computer architecture; Control systems; Electronic switching systems; Memory architecture; Memory management; Operating systems; Read only memory; Resource management; Runtime; Scalability;

fLanguage

English

Publisher

ieee

Conference_Titel

High-Performance Computer Architecture, 1998. Proceedings., 1998 Fourth International Symposium on

Conference_Location

Las Vegas, NV

Print_ISBN

0-8186-8323-6

Type

conf

DOI

10.1109/HPCA.1998.650554

Filename

650554