DocumentCode
2285657
Title
Trading bandwidth for latency: managing continuations through a carpet bag cache
Author
Murphy, Richard C. ; Kogge, Peter M.
Author_Institution
Dept. of Comput. Sci. & Eng., Notre Dame Univ., USA
fYear
2002
fDate
2002
Firstpage
41
Lastpage
49
Abstract
Processing-in-memory (PIM) circumvents the von Neumann bottleneck by combining logic and memory (typically DRAM) on a single die. This work examines the performance of a mobile thread execution model in which threads traverse the system\´s address space in search of their required data over a massively parallel PIM array targeted at petaFLOP performance. This model is enabled through the use of a carpetbag cache which travels with the thread and provides data from the node previously visited. In this way, the latency of traveling to a node already visited is avoided by paying the additional bandwidth and packaging costs associated with moving the cache. Furthermore, it is shown that thrashing between two nodes will generally occur without the use of the cache. Each of the simulations conducted in this work was conducted under stress. By using the Data Intensive Systems (DIS) benchmark suite, the model was subjected to "worst case" memory access patterns, and ultimately still proved highly successful. Additionally, it is shown that this model caters to the positive aspects of a PIM - specifically, the very fast local memory access available.
Keywords
cache storage; distributed programming; memory architecture; parallel architectures; performance evaluation; random-access storage; DIS benchmark suite; DRAM; Data Intensive Systems; carpetbag cache; continuation management; fast local memory access; latency; massively parallel PIM array; mobile thread execution model; petaFLOP performance; processing-in-memory; thrashing; worst case memory access patterns; Bandwidth; Delay; Laboratories; Logic; Memory management; Packaging; Propulsion; Random access memory; Read-write memory; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovative Architecture for Future Generation High-Performance Processors and Systems, 2002. International Workshop on
ISSN
1537-3223
Print_ISBN
0-7695-1635-1
Type
conf
DOI
10.1109/IWIA.2002.1035017
Filename
1035017
Link To Document