مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving GPU Performance Prediction with Data Transfer Modeling

DocumentCode :

1997211

Title :

Improving GPU Performance Prediction with Data Transfer Modeling

Author :

Boyer, Megan ; Jiayuan Meng ; Kumaran, Kalyan

Author_Institution :

Dept. of Comput. Sci., Univ. of Virginia, Charlottesville, VA, USA

fYear :

2013

fDate :

20-24 May 2013

Firstpage :

1097

Lastpage :

1106

Abstract :

Accelerators such as graphics processors (GPUs) have become increasingly popular for high performance scientific computing. Often, much effort is invested in creating and optimizing GPU code without any guaranteed performance benefit. To reduce this risk, performance models can be used to project a kernel´s GPU performance potential before it is ported. However, raw GPU execution time is not the only consideration. The overhead of transferring data between the CPU and the GPU is also an important factor; for some applications, this overhead may even erase the performance benefits of GPU acceleration. To address this challenge, we propose a GPU performance modeling framework that predicts both kernel execution time and data transfer time. Our extensions to an existing GPU performance model include a data usage analyzer for a sequence of GPU kernels, to determine the amount of data that needs to be transferred, and a performance model of the PCIe bus, to determine how long the data transfer will take. We have tested our framework using a set of applications running on a production machine at Argonne National Laboratory. On average, our model predicts the data transfer overhead with an error of only 8%, and the inclusion of data transfer time reduces the error in the predicted GPU speedup from 255% to 9%.

Keywords :

graphics processing units; operating system kernels; performance evaluation; peripheral interfaces; Argonne National Laboratory; GPU acceleration; GPU code optimization; GPU kernel sequence; GPU performance modeling framework; GPU performance prediction improvement; GPU speedup prediction; PCIe bus; data transfer modeling; data transfer overhead prediction; data usage analyzer; graphics processing units; graphics processors; high performance scientific computing; performance models; production machine; risk reduction; Bandwidth; Computational modeling; Data models; Data transfer; Graphics processing units; Kernel; Time measurement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International

Conference_Location :

Cambridge, MA

Print_ISBN :

978-0-7695-4979-8

Type :

conf

DOI :

10.1109/IPDPSW.2013.236

Filename :

6650995

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1997211