مرکز منطقه ای اطلاع رساني علوم و فناوري - Memory fast-forward: A low cost special function unit to enhance energy efficiency in GPU for big data processing

DocumentCode :

704048

Title :

Memory fast-forward: A low cost special function unit to enhance energy efficiency in GPU for big data processing

Author :

Eunhyeok Park ; Junwhan Ahn ; Sungpack Hong ; Sungjoo Yoo ; Sunggu Lee

fYear :

2015

fDate :

9-13 March 2015

Firstpage :

1341

Lastpage :

1346

Abstract :

Big data processing, e.g., graph computation and MapReduce, is characterized by massive parallelism in computation and a large amount of fine-grained random memory accesses often with structural localities due to graph-like data dependency. Recently, GPU is gaining more and more attention for servers due to its capability of parallel computation. However, the current GPU architecture is not well suited to big data workloads due to the limited capability of handling a large number of memory requests. In this paper, we present a special function unit, called memory fast-forward (MFF) unit, to address this problem. Our proposed MFF unit provides two key functions. First, it supports pointer chasing which enables computation threads to issue as many memory requests as possible to increase the potential of coalescing memory requests. Second, it coalesces memory requests bound for the same cache block, often due to structural locality, thereby reducing memory traffics. Both pointer chasing and memory request coalescing contribute to reducing memory stall time as well as improving the real utilization of memory bandwidth, by removing duplicate memory traffics, thereby improving performance and energy efficiency. Our experiments with graph computation algorithms and real graphs show that the proposed MFF unit can improve the energy efficiency of GPU in graph computation by average 54.6% at a negligible area cost.

Keywords :

Big Data; cache storage; graphics processing units; parallel processing; power aware computing; GPU; MFF unit; big data processing; cache block; coalescing memory requests; computation threads; energy efficiency; graph computation algorithms; memory fast-forward unit; memory stall reduction; memory traffic reduction; parallel computation; pointer chasing; structural locality; Arrays; Big data; Graphics processing units; Instruction sets; Memory management; Registers;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015

Conference_Location :

Grenoble

Print_ISBN :

978-3-9815-3704-8

Type :

conf

Filename :

7092600

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=704048