DocumentCode :
3664307
Title :
Improved Internode Communication for Tile QR Decomposition for Multicore Cluster Systems
Author :
Tomohiro Suzuki
Author_Institution :
Dept. of Interdiscipl. Res., Univ. of Yamanashi, Kofu, Japan
fYear :
2015
fDate :
5/1/2015 12:00:00 AM
Firstpage :
1214
Lastpage :
1220
Abstract :
Tile algorithms for matrix decomposition can generate many fine-grained tasks. Therefore, their suitability for processing with multicourse architecture has attracted much attention from the high-performance computing (HPC) community. Our implementation of tile QR decomposition for a cluster system has dynamic scheduling, OpenMP work- sharing, and other useful features. In this article, we discuss the problems in internodes communications that were present in our previous implementation. The improved implementation has both strong and weak scalability.
Keywords :
"Matrix decomposition","Kernel","Dynamic scheduling","Instruction sets","Heuristic algorithms","Clustering algorithms","Multicore processing"
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International
Type :
conf
DOI :
10.1109/IPDPSW.2015.145
Filename :
7284451
Link To Document :
بازگشت