DocumentCode :
244490
Title :
Porting a neuro-imaging application to a CPU-GPU cluster
Author :
Nakhjavani, Reza Sina ; Sharify, Sahel ; Hashemi, Ali B. ; Lu, Alan W. ; Amza, C. ; Strother, Stephen
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Toronto, Toronto, ON, Canada
fYear :
2014
fDate :
21-25 July 2014
Firstpage :
137
Lastpage :
145
Abstract :
The ever increasing complexity of scientific applications has led to utilization of new HPC paradigms such as Graphical Processing Units (GPUs). However, modifying existing applications to enable them to be executed on GPU can be challenging. Furthermore, the considerable speedup achieved by execution of linear algebra operations on GPUs has added a huge heterogeneity to HPC clusters. In this work, we enabled NPAIRS, a neuro-imaging application, to be executed on GPU with slight modifications to its original code. This important feature of our implementation enables current users of NPAIRS, i.e. non-expert bio-medical scientists, to get benefit from GPU without having to apply fundamental changes to their existing application. As the second part of our research, we investigated the efficiency of several scheduling algorithms for a heterogeneous cluster that contains GPU nodes. Experimental results show that we achieved 7× speedup for NPAIRS. Moreover, although scheduling does not play an important role when there is no GPU node in the cluster, it can highly improve the makespan for a CPU-GPU cluster. We compared our scheduling results with Torque and MCT, two of the most commonly used schedulers in current HPC platforms. Our results show that the Sufferage scheduling can improve the makespan of Torque and MCT by 47% and 4% respectively.
Keywords :
graphics processing units; medical image processing; neurophysiology; parallel processing; scheduling; CPU-GPU cluster; HPC clusters; MCT; NPAIRS; Sufferage scheduling algorithms; Torque; graphical processing units; heterogeneous cluster; linear algebra operations; neuroimaging application; nonexpert biomedical scientists; Eigenvalues and eigenfunctions; Graphics processing units; Java; Job shop scheduling; Libraries; Principal component analysis; Scheduling algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing & Simulation (HPCS), 2014 International Conference on
Conference_Location :
Bologna
Print_ISBN :
978-1-4799-5312-7
Type :
conf
DOI :
10.1109/HPCSim.2014.6903679
Filename :
6903679
Link To Document :
بازگشت