Title :
Communication-avoiding algorithms for linear algebra and beyond
Author_Institution :
Univ. of California, Berkeley, Berkeley, CA, USA
Abstract :
Algorithms have two costs: arithmetic and communication, i.e. moving data between levels of a memory hierarchy or processors over a network. Communication costs (measured in time or energy per operation) already greatly exceed arithmetic costs, and the gap is growing over time following technological trends. Thus our goal is to design algorithms that minimize communication. We present algorithms that attain provable lower bounds on communication, and show large speedups compared to their conventional counterparts. These algorithms are for direct and iterative linear algebra, for dense and sparse matrices, as well as direct n-body simulations. Several of these algorithms exhibit perfect strong scaling, in both time and energy: run time (resp. energy) for a fixed problem size drops proportionally to p (resp. is independent of p). Finally, we describe extensions to algorithms involving arbitrary loop nests and array accesses, assuming only that array subscripts are linear functions of the loop indices.
Keywords :
algorithm theory; iterative methods; linear algebra; minimisation; sparse matrices; arithmetic cost; array access; array subscripts; communication cost; communication-avoiding algorithms; dense matrices; direct linear algebra; direct n-body simulations; iterative linear algebra; linear algebra; linear functions; loop indices; loop nest; lower bounds; memory hierarchy; perfect strong scaling; sparse matrices; time following technological trends; Abstracts; Algorithm design and analysis; Arrays; Distributed processing; Educational institutions; Linear algebra;
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4673-6066-1
DOI :
10.1109/IPDPS.2013.123