• DocumentCode
    656210
  • Title

    Tiled QR Decomposition and Its Optimization on CPU and GPU Computing System

  • Author

    Dongjin Kim ; Kyu-Ho Park

  • Author_Institution
    Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
  • fYear
    2013
  • fDate
    1-4 Oct. 2013
  • Firstpage
    744
  • Lastpage
    753
  • Abstract
    There can be many types of heterogeneous computing systems, and the most useful one is the CPU and GPU computing system. In this system, we try to run QR decomposition, which expresses a standard real matrix as a production of two matrices. For a tiled QR decomposition algorithm, which is a parallelized version of QR decomposition, because of the heterogeneity of computing devices and communication cost, the way that each tile is distributed into which device is the main issue of tiled QR decomposition. The goal of this study is to optimize the tile distribution and the tiled QR decomposition operation mathematically, depending on the given system. We select the main computing device for the main steps of the algorithm, optimize the number of devices, and optimize the tile distribution among the devices using a distribution guide array. Our evaluation confirms that our method has good scalability and the optimization process maximizes the tiled QR decomposition performance.
  • Keywords
    graphics processing units; matrix algebra; optimisation; CPU computing system; GPU computing system; computing devices; distribution guide array; optimization process; standard real matrix; tiled QR decomposition; Equations; Graphics processing units; Mathematical model; Matrix decomposition; Parallel processing; Performance evaluation; Tiles; GPU; Hybrid; QR decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2013 42nd International Conference on
  • Conference_Location
    Lyon
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2013.88
  • Filename
    6687413