DocumentCode :
625676
Title :
A Scalable Heterogeneous Parallelization Framework for Iterative Local Searches
Author :
Burtscher, Martin ; Rabeti, Hassan
Author_Institution :
Dept. of Comput. Sci., Texas State Univ. - San Marcos, San Marcos, TX, USA
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1289
Lastpage :
1298
Abstract :
This paper describes and evaluates a highly-scalable framework for running iterative local searches on heterogeneous HPC platforms. The user only needs to provide serial CPU or single-GPU code that implements a simple interface. The framework then executes this code in parallel using MPI between compute nodes and OpenMP and multi-GPU support within nodes. It handles all parallelization aspects, seed distribution and program termination, and it regularly records the currently best solution. We evaluate our framework on three supercomputers using a heuristic iterative hill-climbing TSP solver as well as a search for good finite-state machines. The framework scales to 2048 nodes (32,768 cores) on Ranger with less than a 5% drop in efficiency, searches over 12.2 trillion TSP tours per second on Stampede using 1024 nodes, and evaluates over 21.5 trillion FSM transitions per second using 256 CPUs and 384 GPUs on Keeneland.
Keywords :
finite state machines; graphics processing units; iterative methods; mathematics computing; parallel processing; search problems; travelling salesman problems; FSM; Keeneland; MPI; OpenMP; Ranger; Stampede; finite-state machine; heterogeneous HPC platform; heuristic iterative hill-climbing TSP solver; iterative local search; multiGPU support; program termination; scalable heterogeneous parallelization framework; seed distribution; serial CPU; single-GPU code; supercomputer; Central Processing Unit; Cities and towns; Data structures; Graphics processing units; Instruction sets; Optimization; Supercomputers; heterogeneous CPU/GPU computing; iterative local champion search; parallelization framework;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location :
Boston, MA
ISSN :
1530-2075
Print_ISBN :
978-1-4673-6066-1
Type :
conf
DOI :
10.1109/IPDPS.2013.27
Filename :
6569904
Link To Document :
بازگشت