DocumentCode :
704048
Title :
Memory fast-forward: A low cost special function unit to enhance energy efficiency in GPU for big data processing
Author :
Eunhyeok Park ; Junwhan Ahn ; Sungpack Hong ; Sungjoo Yoo ; Sunggu Lee
fYear :
2015
fDate :
9-13 March 2015
Firstpage :
1341
Lastpage :
1346
Abstract :
Big data processing, e.g., graph computation and MapReduce, is characterized by massive parallelism in computation and a large amount of fine-grained random memory accesses often with structural localities due to graph-like data dependency. Recently, GPU is gaining more and more attention for servers due to its capability of parallel computation. However, the current GPU architecture is not well suited to big data workloads due to the limited capability of handling a large number of memory requests. In this paper, we present a special function unit, called memory fast-forward (MFF) unit, to address this problem. Our proposed MFF unit provides two key functions. First, it supports pointer chasing which enables computation threads to issue as many memory requests as possible to increase the potential of coalescing memory requests. Second, it coalesces memory requests bound for the same cache block, often due to structural locality, thereby reducing memory traffics. Both pointer chasing and memory request coalescing contribute to reducing memory stall time as well as improving the real utilization of memory bandwidth, by removing duplicate memory traffics, thereby improving performance and energy efficiency. Our experiments with graph computation algorithms and real graphs show that the proposed MFF unit can improve the energy efficiency of GPU in graph computation by average 54.6% at a negligible area cost.
Keywords :
Big Data; cache storage; graphics processing units; parallel processing; power aware computing; GPU; MFF unit; big data processing; cache block; coalescing memory requests; computation threads; energy efficiency; graph computation algorithms; memory fast-forward unit; memory stall reduction; memory traffic reduction; parallel computation; pointer chasing; structural locality; Arrays; Big data; Graphics processing units; Instruction sets; Memory management; Registers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015
Conference_Location :
Grenoble
Print_ISBN :
978-3-9815-3704-8
Type :
conf
Filename :
7092600
Link To Document :
بازگشت