Trading bandwidth for latency: managing continuations through a carpet bag cache

Author

Murphy, Richard C. ; Kogge, Peter M.

Author_Institution

Dept. of Comput. Sci. & Eng., Notre Dame Univ., USA

fYear

2002

fDate

2002

Firstpage

41

Lastpage

49

Abstract

Processing-in-memory (PIM) circumvents the von Neumann bottleneck by combining logic and memory (typically DRAM) on a single die. This work examines the performance of a mobile thread execution model in which threads traverse the system\´s address space in search of their required data over a massively parallel PIM array targeted at petaFLOP performance. This model is enabled through the use of a carpetbag cache which travels with the thread and provides data from the node previously visited. In this way, the latency of traveling to a node already visited is avoided by paying the additional bandwidth and packaging costs associated with moving the cache. Furthermore, it is shown that thrashing between two nodes will generally occur without the use of the cache. Each of the simulations conducted in this work was conducted under stress. By using the Data Intensive Systems (DIS) benchmark suite, the model was subjected to "worst case" memory access patterns, and ultimately still proved highly successful. Additionally, it is shown that this model caters to the positive aspects of a PIM - specifically, the very fast local memory access available.

Keywords

cache storage; distributed programming; memory architecture; parallel architectures; performance evaluation; random-access storage; DIS benchmark suite; DRAM; Data Intensive Systems; carpetbag cache; continuation management; fast local memory access; latency; massively parallel PIM array; mobile thread execution model; petaFLOP performance; processing-in-memory; thrashing; worst case memory access patterns; Bandwidth; Delay; Laboratories; Logic; Memory management; Packaging; Propulsion; Random access memory; Read-write memory; Yarn;

fLanguage

English

Publisher

ieee

Conference_Titel

Innovative Architecture for Future Generation High-Performance Processors and Systems, 2002. International Workshop on

ISSN

1537-3223

Print_ISBN

0-7695-1635-1

Type

conf

DOI

10.1109/IWIA.2002.1035017

Filename

1035017