DocumentCode :
1787043
Title :
An efficient STT-RAM last level cache architecture for GPUs
Author :
Samavatian, Mohammad Hossein ; Abbasitabar, Hamed ; Arjomand, Mohammad ; Sarbazi-Azad, H.
Author_Institution :
Comput. Eng. Dept., Sharif Univ. of Technol., Tehran, Iran
fYear :
2014
fDate :
1-5 June 2014
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, having investigated the behavior of GPGPU applications, we present an efficient L2 cache architecture for GPUs based on STT-RAM technology. With the increase of processing cores count, larger on-chip memories are required. Due to its high density and low power characteristics, STT-RAM technology can be utilized in GPUs where numerous cores leave a limited area for on-chip memory banks. They have however two important issues, high energy and latency of write operations, that have to be addressed. Low data retention time STT-RAMs can reduce the energy and delay of write operations. However, employing STT-RAMs with low retention time in GPUs requires a thorough investigation on the behavior of GPGPU applications based on which the STT-RAM based L2 cache is architectured. The STT-RAM L2 cache architecture proposed in this paper, can improve IPC by more than 100% (16% on average) while reducing the average consumed power by 20% compared to a conventional L2 cache architecture with equal on-chip area.
Keywords :
cache storage; graphics processing units; random-access storage; GPGPU applications; IPC; STT-RAM L2 cache architecture; STT-RAM last level cache architecture; STT-RAM technology; low data retention time STT-RAM; on-chip area; on-chip memory banks; processing cores count; write operations; Arrays; Graphics processing units; Instruction sets; Microprocessors; Random access memory; Registers; GPGPU Application; GPU; Retention Time; STT-RAM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design Automation Conference (DAC), 2014 51st ACM/EDAC/IEEE
Conference_Location :
San Francisco, CA
Type :
conf
Filename :
6881524
Link To Document :
بازگشت