DocumentCode
1925232
Title
A compiler for high performance computing with many-core accelerators
Author
Nakasato, Naohito ; Makino, Jun
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Aizu, Aizu-Wakamatsu, Japan
fYear
2009
fDate
Aug. 31 2009-Sept. 4 2009
Firstpage
1
Lastpage
9
Abstract
We introduce a newly developed compiler for high performance computing using many-core accelerators. A high peak performance of such accelerators attracts researchers who are always demanding faster computers. However, it is difficult to create an efficient implementation of an existing serial program for such accelerators even in the case of massively parallel problems. While existing parallel programming tools force us to program every details of an implementation from loop-level parallelism to 4-vector SIMD operations, our novel approach is that given a compute intensive problem expressed as a nested loop, the compiler only ask us to define a compute kernel inside the inner-most loop. We observe that input variables appeared in the kernel is classified into two types; invariant during the loop and variables updated in each iteration. The compiler let us to specify either type of the inputs so as it pick a predefined optimal way to process them. The compiler successfully generates the fastest code ever for many-particle simulations with the performance of 500 GFLOPS (single precision) on RV770 GPU. Another successful application is the evaluation of a multidimensional integral. It runs at a speed of 5 - 7 GFLOPS (quadruple precision) on both GRAPE-DR and GPU.
Keywords
parallel programming; program compilers; program control structures; vector processor systems; 4-vector SIMD operation; GPU; GRAPE-DR; compiler; high performance computing; loop-level parallelism; many-core accelerator; many-particle simulation; multidimensional integral; nested loop; parallel programming; serial program; Acceleration; Concurrent computing; High performance computing; Input variables; Kernel; Multidimensional systems; Parallel processing; Parallel programming; Pipelines; Program processors;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and Workshops, 2009. CLUSTER '09. IEEE International Conference on
Conference_Location
New Orleans, LA
ISSN
1552-5244
Print_ISBN
978-1-4244-5011-4
Electronic_ISBN
1552-5244
Type
conf
DOI
10.1109/CLUSTR.2009.5289127
Filename
5289127
Link To Document