• DocumentCode
    228686
  • Title

    A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm

  • Author

    Koanantakool, Penporn ; Yelick, Katherine

  • Author_Institution
    Comput. Sci. Div., Univ. of California, Berkeley, Berkeley, CA, USA
  • fYear
    2014
  • fDate
    16-21 Nov. 2014
  • Firstpage
    363
  • Lastpage
    374
  • Abstract
    Traditional particle simulation methods are used to calculate pair wise potentials, but some problems require 3-body potentials that calculate over triplets of particles. A direct calculation of 3-body interactions involves O(n3) interactions, but has significant redundant computations that occur in a nested loop formulation. In this paper we explore algorithms for 3-body computations that simultaneously optimize three criteria: computation minimization through symmetries, communication optimality, and load balancing. We present a new 3-body algorithm that is both communication and computation optimal. Its optional replication factor, c, saves c3 in latency (number of messages) and c2 in bandwidth (volume), with bounded load imbalance. We also consider the k-body case and discuss an algorithm that is optimal if there is a cut off distance of less than 1/3 of the domain. The 3-body algorithm demonstrates 99% efficiency on tens of thousands of cores, showing strong scaling properties with order of magnitude speedups over the nïve algorithm.
  • Keywords
    parallel algorithms; resource allocation; 3-body computations; 3-body interactions; bounded load imbalance; communication optimality; communication-optimal parallel direct 3-body algorithm; computation minimization; computation-optimal parallel direct 3-body algorithm; k-body case; load balancing; nested loop formulation; optional replication factor; particle simulation methods; Approximation algorithms; Bandwidth; Clustering algorithms; Force; Heuristic algorithms; Program processors; Three-dimensional displays; communication-avoiding algorithms; n-body; parallel algorithms; particle methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4799-5499-5
  • Type

    conf

  • DOI
    10.1109/SC.2014.35
  • Filename
    7013017