DocumentCode :
1577114
Title :
Integrating Memory Compression and Decompression with Coherence Protocols in Distributed Shared Memory Multiprocessors
Author :
Vittanala, Lakshmana Rao ; Chaudhuri, Mainak
Author_Institution :
Intel Technol. India Pvt. Ltd., Bangalore
fYear :
2007
Firstpage :
4
Lastpage :
4
Abstract :
Ever-increasing memory footprint of applications and increasing mainstream popularity of shared memory parallel computing motivate us to explore memory compression potential in distributed shared memory (DSM) multiprocessors. This paper for the first time integrates on-the-fly cache block compression/decompression algorithms in the cache coherence protocols by leveraging the directory structure already present in these scalable machines. Our proposal is unique in the sense that instead of employing custom compression/decompression hardware, we use a simple on-die protocol processing core in dual-core nodes for running our directory-based coherence protocol suitably extended with compression/decompression algorithms. We design a low-overhead compression scheme based on frequent patterns and zero runs present in the evicted dirty L2 cache blocks. Our compression algorithm examines the first eight bytes of an evicted dirty L2 block arriving at the home memory controller and speculates which compression scheme to invoke for the rest of the block. Our customized algorithm for handling completely zero cache blocks helps hide a significant amount of memory access latency. Our simulation-based experiments on a 16-node DSM multiprocessor with seven scientific computing applications show that our best design achieves, on average, 16% to 73% storage saving per evicted dirty L2 cache block for four out of the seven applications at the expense of at most 15% increased parallel execution time.
Keywords :
cache storage; data compression; distributed shared memory systems; memory protocols; cache coherence protocol; distributed shared memory multiprocessor; dual-core node; evicted dirty L2 cache block; memory access latency; memory compression; memory controller; memory decompression; on-die protocol processing; parallel computing; Airports; Application software; Compression algorithms; Computational modeling; Computer architecture; Computer science; Hardware; Parallel processing; Protocols; Scientific computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 2007. ICPP 2007. International Conference on
Conference_Location :
Xi´an
ISSN :
0190-3918
Print_ISBN :
978-0-7695-2933-2
Type :
conf
DOI :
10.1109/ICPP.2007.51
Filename :
4343811
Link To Document :
بازگشت