DocumentCode
3664307
Title
Improved Internode Communication for Tile QR Decomposition for Multicore Cluster Systems
Author
Tomohiro Suzuki
Author_Institution
Dept. of Interdiscipl. Res., Univ. of Yamanashi, Kofu, Japan
fYear
2015
fDate
5/1/2015 12:00:00 AM
Firstpage
1214
Lastpage
1220
Abstract
Tile algorithms for matrix decomposition can generate many fine-grained tasks. Therefore, their suitability for processing with multicourse architecture has attracted much attention from the high-performance computing (HPC) community. Our implementation of tile QR decomposition for a cluster system has dynamic scheduling, OpenMP work- sharing, and other useful features. In this article, we discuss the problems in internodes communications that were present in our previous implementation. The improved implementation has both strong and weak scalability.
Keywords
"Matrix decomposition","Kernel","Dynamic scheduling","Instruction sets","Heuristic algorithms","Clustering algorithms","Multicore processing"
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International
Type
conf
DOI
10.1109/IPDPSW.2015.145
Filename
7284451
Link To Document