• DocumentCode
    625676
  • Title

    A Scalable Heterogeneous Parallelization Framework for Iterative Local Searches

  • Author

    Burtscher, Martin ; Rabeti, Hassan

  • Author_Institution
    Dept. of Comput. Sci., Texas State Univ. - San Marcos, San Marcos, TX, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1289
  • Lastpage
    1298
  • Abstract
    This paper describes and evaluates a highly-scalable framework for running iterative local searches on heterogeneous HPC platforms. The user only needs to provide serial CPU or single-GPU code that implements a simple interface. The framework then executes this code in parallel using MPI between compute nodes and OpenMP and multi-GPU support within nodes. It handles all parallelization aspects, seed distribution and program termination, and it regularly records the currently best solution. We evaluate our framework on three supercomputers using a heuristic iterative hill-climbing TSP solver as well as a search for good finite-state machines. The framework scales to 2048 nodes (32,768 cores) on Ranger with less than a 5% drop in efficiency, searches over 12.2 trillion TSP tours per second on Stampede using 1024 nodes, and evaluates over 21.5 trillion FSM transitions per second using 256 CPUs and 384 GPUs on Keeneland.
  • Keywords
    finite state machines; graphics processing units; iterative methods; mathematics computing; parallel processing; search problems; travelling salesman problems; FSM; Keeneland; MPI; OpenMP; Ranger; Stampede; finite-state machine; heterogeneous HPC platform; heuristic iterative hill-climbing TSP solver; iterative local search; multiGPU support; program termination; scalable heterogeneous parallelization framework; seed distribution; serial CPU; single-GPU code; supercomputer; Central Processing Unit; Cities and towns; Data structures; Graphics processing units; Instruction sets; Optimization; Supercomputers; heterogeneous CPU/GPU computing; iterative local champion search; parallelization framework;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
  • Conference_Location
    Boston, MA
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4673-6066-1
  • Type

    conf

  • DOI
    10.1109/IPDPS.2013.27
  • Filename
    6569904