DocumentCode :
656210
Title :
Tiled QR Decomposition and Its Optimization on CPU and GPU Computing System
Author :
Dongjin Kim ; Kyu-Ho Park
Author_Institution :
Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
fYear :
2013
fDate :
1-4 Oct. 2013
Firstpage :
744
Lastpage :
753
Abstract :
There can be many types of heterogeneous computing systems, and the most useful one is the CPU and GPU computing system. In this system, we try to run QR decomposition, which expresses a standard real matrix as a production of two matrices. For a tiled QR decomposition algorithm, which is a parallelized version of QR decomposition, because of the heterogeneity of computing devices and communication cost, the way that each tile is distributed into which device is the main issue of tiled QR decomposition. The goal of this study is to optimize the tile distribution and the tiled QR decomposition operation mathematically, depending on the given system. We select the main computing device for the main steps of the algorithm, optimize the number of devices, and optimize the tile distribution among the devices using a distribution guide array. Our evaluation confirms that our method has good scalability and the optimization process maximizes the tiled QR decomposition performance.
Keywords :
graphics processing units; matrix algebra; optimisation; CPU computing system; GPU computing system; computing devices; distribution guide array; optimization process; standard real matrix; tiled QR decomposition; Equations; Graphics processing units; Mathematical model; Matrix decomposition; Parallel processing; Performance evaluation; Tiles; GPU; Hybrid; QR decomposition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location :
Lyon
ISSN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2013.88
Filename :
6687413
Link To Document :
بازگشت