DocumentCode :
2806516
Title :
Patterns of Inefficient Performance Behavior in GPU Applications
Author :
Eschweiler, Dominic ; Becker, Daniel ; Wolf, Felix
Author_Institution :
Julich Supercomput. Centre, Forschungszentrum Julich, Jülich, Germany
fYear :
2011
fDate :
9-11 Feb. 2011
Firstpage :
262
Lastpage :
266
Abstract :
Writing efficient software for heterogeneous architectures equipped with modern accelerator devices presents a serious challenge to programmer productivity, creating a need for powerful performance-analysis tools to adequately support the software development process. To guide the design of such tools, we describe typical patterns of inefficient runtime behavior that may adversely affect the performance of applications that use general-purpose processors along with GPU devices through a CUDA compute engine. To evaluate the general impact of these patterns on application performance, we further present a micro benchmark suite that allows the performance penalty of each pattern to be quantified with results obtained on NVIDIA Fermi and Tesla architectures, indeed demonstrating significant delays. Furthermore this suite can be used as a default test scenario to add CUDA support to performance-analysis tools used in high-performance computing.
Keywords :
computer graphic equipment; coprocessors; parallel architectures; performance evaluation; CUDA compute engine; GPU applications; NVIDIA Fermi architecture; Tesla architecture; compute unified device architecture; general-purpose processors; graphics processing unit; high performance computing; inefficient performance behavior pattern; performance analysis tools; software development process; Computer architecture; Graphics processing unit; Instruction sets; Kernel; Performance evaluation; Registers; Runtime; analysis; cuda; patterns; performance; tools;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
Conference_Location :
Ayia Napa
ISSN :
1066-6192
Print_ISBN :
978-1-4244-9682-2
Type :
conf
DOI :
10.1109/PDP.2011.84
Filename :
5738996
Link To Document :
بازگشت