DocumentCode
2141717
Title
OpenMP/MPI Implementation of Tile QR Factorization on T2K Open Supercomputer
Author
Suzuki, Takumi ; Miyashita, Hiroaki
Author_Institution
Interdiscipl. Grad. Sch. of Med. & Eng., Univ. of Yamanashi, Kofu, Japan
fYear
2013
fDate
26-28 Sept. 2013
Firstpage
141
Lastpage
146
Abstract
We implement the tile QR factorization algorithm on T2K open supercomputer using an OpenMP/MPI programming model. Our implementation does not outperform the ScaLAPACK routine for single-node performance. However, as the number of CPU nodes increases, our implementation clearly outperforms ScaLAPACK and shows good scalability in the form of both strong and weak scaling. With respect to accuracy, our implementation and ScaLAPACK achieve almost the same results. In this report, we provide the technical details of our implementation.
Keywords
digital arithmetic; matrix decomposition; message passing; parallel machines; CPU nodes; MPI implementation; OpenMP implementation; ScaLAPACK routine; T2K open supercomputer; single-node performance; tile QR factorization; Dynamic scheduling; Heuristic algorithms; Kernel; Plasmas; Scalability; Supercomputers; Tiles; MPI; OpenMP; QR factorization; tile algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Embedded Multicore Socs (MCSoC), 2013 IEEE 7th International Symposium on
Conference_Location
Tokyo
Type
conf
DOI
10.1109/MCSoC.2013.20
Filename
6657920
Link To Document