DocumentCode
1914078
Title
Developing Performance-Portable Molecular Dynamics Kernels in OpenCL
Author
Pennycook, S.J. ; Jarvis, S.A.
Author_Institution
Dept. of Comput. Sci., Univ. of Warwick, Coventry, UK
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
386
Lastpage
395
Abstract
This paper investigates the development of a molecular dynamics code that is highly portable between architectures. Using OpenCL, we develop an implementation of Sandia´s miniMD benchmark that achieves good levels of performance across a wide range of hardware: CPUs, discrete GPUs and integrated GPUs. We demonstrate that the performance bottlenecks of miniMD´s short-range force calculation kernel are the same across these architectures, and detail a number of platform-agnostic optimisations that improve its performance by at least 2x on all hardware considered. Our complete code is shown to be 1.7x faster than the original miniMD, and at most 2x slower than implementations individually hand-tuned for a specific architecture.
Keywords
graphics processing units; parallel processing; CPU; OpenCL; Sandia miniMD benchmark; discrete GPU; high-performance computing; integrated GPU; molecular dynamics code; performance-portable molecular dynamics kernels; platform-agnostic optimisations; short-range force calculation kernel; accelerator architectures; high performance computing; parallel programming; performance analysis; scientific computing;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location
Salt Lake City, UT
Print_ISBN
978-1-4673-6218-4
Type
conf
DOI
10.1109/SC.Companion.2012.58
Filename
6495840
Link To Document