DocumentCode
2536429
Title
Non-intrusive Performance Analysis of Parallel Hardware Accelerated Applications on Hybrid Architectures
Author
Dietrich, Robert ; Ilsche, Thomas ; Juckeland, Guido
Author_Institution
Center for Inf. Services & High Performance Comput. (ZIH), Tech. Univ. Dresden, Dresden, Germany
fYear
2010
fDate
13-16 Sept. 2010
Firstpage
135
Lastpage
143
Abstract
New high performance computing (HPC) applications recently have to face scalability over an increasing number of nodes and the programming of special accelerator hardware. Hybrid composition of large computing systems leads to a new dimension in complexity of software development. This paper presents a novel approach to gain insight into accelerator interaction and utilization without any changes to the application. It leverages well established methods for performance analysis to accelerator hardware, allowing a holistic view on performance bottlenecks of hybrid applications. A general strategy is presented to get dynamic runtime information about hybrid program execution with minimal impact on the program ???ow. The achievable level of detail is exemplarily studied for the CUDA environment and the OpenCL framework. Combined with existing performance analysis techniques this facilitates obtaining the full potential of hybrid computing power.
Keywords
hybrid simulation; parallel processing; software engineering; CUDA environment; HPC applications; OpenCL framework; high performance computing; hybrid architectures; large computing systems; nonintrusive performance analysis; parallel hardware accelerated applications; software development; Hardware; Instruments; Kernel; Libraries; Monitoring; Runtime; Synchronization; GPGPU; accelerators; event logging; monitoring libraries; performance analysis; tracing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing Workshops (ICPPW), 2010 39th International Conference on
Conference_Location
San Diego, CA
ISSN
1530-2016
Print_ISBN
978-1-4244-7918-4
Electronic_ISBN
1530-2016
Type
conf
DOI
10.1109/ICPPW.2010.30
Filename
5599208
Link To Document