DocumentCode
3410673
Title
Adaptive granularity: transparent integration of fine- and coarse-grain communication
Author
Park, Daeyeon ; Saavedra, Rafael H.
Author_Institution
Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
fYear
1996
fDate
35339
Firstpage
260
Lastpage
268
Abstract
The granularity of shared data is one of the key factors affecting the performance of distributed shared memory machines (DSM). Given that programs exhibit quite different sharing patterns, providing only one or two fixed granularities cannot result in an efficient use of resources. On the other hand, supporting arbitrarily granularity sizes significantly increases not only hardware complexity but software overhead as well. Furthermore, the efficient use of arbitrarily granularities put the burden on users to provide information about program behavior to compilers and/or runtime systems. These kind of requirements tend to restrict the programmability of the shared memory model. In this paper we present a new communication scheme, called adaptive granularity (AG). Adaptive granularity makes it possible to transparently integrate bulk transfer into the shared memory model by supporting variable-size granularity and memory replication. It consists of two protocols: one for small data and another for large data. For small size data, the standard hardware DSM protocol is used and the granularity is fixed to the size of a cache line. For large array data, the protocol for bulk data is used instead and the granularity varies depending on the sharing behavior of applications at runtime. Simulation results show that AG improves performance up to 43% over the hardware implementation of DSM (e.g., DASH, Alewife). Compared with an equivalent architecture that supports fine-grain memory replication at the fixed granularity of a cache line (e.g., Typhoon), AG reduces execution time up to 35%
Keywords
computational complexity; distributed memory systems; parallel architectures; performance evaluation; protocols; shared memory systems; adaptive granularity; coarse-grain communication; communication scheme; compilers; distributed shared memory machines; fine-grain communication; granularity sizes; hardware complexity; memory replication; performance; program behavior; protocols; runtime systems; sharing behavior; simulation results; software overhead; transparent integration; Computer science; Dynamic programming; Hardware; Parallel programming; Program processors; Protocols; Read-write memory; Runtime; Typhoons; Uniform resource locators;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architectures and Compilation Techniques, 1996., Proceedings of the 1996 Conference on
Conference_Location
Boston, MA
ISSN
1089-795X
Print_ISBN
0-8186-7633-7
Type
conf
DOI
10.1109/PACT.1996.554034
Filename
554034
Link To Document