Title :
High Performance Matrix Inversion on a Multi-core Platform with Several GPUs
Author :
Ezzatti, Pablo ; Quintana-Ortí, Enrique S. ; Remón, Alfredo
Author_Institution :
Centro de Calculo, Univ. de la Republica, Montevideo, Uruguay
Abstract :
Inversion of large-scale matrices appears in a few scientific applications like model reduction or optimal control. Matrix inversion requires an important computational effort and, therefore, the application of high performance computing techniques and architectures for matrices with dimension in the order of thousands. Following the recent uprise of graphics processors (GPUs), we present and evaluate high performance codes for matrix inversion, based on Gauss-Jordan elimination with partial pivoting, which off-load the main computational kernels to one or more GPUs while performing fine-grain operations on the general-purpose processor. The target architecture consists of a multi-core processor connected to several GPUs. Parallelism is extracted from parallel implementations of BLAS and from the concurrent execution of operations in the available computational units. Numerical experiments on a system with two Intel QuadCore processors and four NVIDIA cl060 GPUs illustrate the efficiency and the scalability of the different implementations, which deliver over 1.2 x 1012 floating point operations per second.
Keywords :
computer graphic equipment; coprocessors; matrix algebra; multiprocessing systems; GPU; Gauss-Jordan elimination; Intel QuadCore processors; floating point operations; graphics processors; matrix inversion; multicore platform; multicore processor; optimal control; parallel implementations; Electronic mail; Graphics processing unit; Multicore processing; Parallel processing; Partitioning algorithms; GPUs; linear algebra; matrix inversion;
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
Conference_Location :
Ayia Napa
Print_ISBN :
978-1-4244-9682-2
DOI :
10.1109/PDP.2011.66