DocumentCode :
228686
Title :
A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm
Author :
Koanantakool, Penporn ; Yelick, Katherine
Author_Institution :
Comput. Sci. Div., Univ. of California, Berkeley, Berkeley, CA, USA
fYear :
2014
fDate :
16-21 Nov. 2014
Firstpage :
363
Lastpage :
374
Abstract :
Traditional particle simulation methods are used to calculate pair wise potentials, but some problems require 3-body potentials that calculate over triplets of particles. A direct calculation of 3-body interactions involves O(n3) interactions, but has significant redundant computations that occur in a nested loop formulation. In this paper we explore algorithms for 3-body computations that simultaneously optimize three criteria: computation minimization through symmetries, communication optimality, and load balancing. We present a new 3-body algorithm that is both communication and computation optimal. Its optional replication factor, c, saves c3 in latency (number of messages) and c2 in bandwidth (volume), with bounded load imbalance. We also consider the k-body case and discuss an algorithm that is optimal if there is a cut off distance of less than 1/3 of the domain. The 3-body algorithm demonstrates 99% efficiency on tens of thousands of cores, showing strong scaling properties with order of magnitude speedups over the nïve algorithm.
Keywords :
parallel algorithms; resource allocation; 3-body computations; 3-body interactions; bounded load imbalance; communication optimality; communication-optimal parallel direct 3-body algorithm; computation minimization; computation-optimal parallel direct 3-body algorithm; k-body case; load balancing; nested loop formulation; optional replication factor; particle simulation methods; Approximation algorithms; Bandwidth; Clustering algorithms; Force; Heuristic algorithms; Program processors; Three-dimensional displays; communication-avoiding algorithms; n-body; parallel algorithms; particle methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4799-5499-5
Type :
conf
DOI :
10.1109/SC.2014.35
Filename :
7013017
Link To Document :
بازگشت