Title :
Improved Internode Communication for Tile QR Decomposition for Multicore Cluster Systems
Author_Institution :
Dept. of Interdiscipl. Res., Univ. of Yamanashi, Kofu, Japan
fDate :
5/1/2015 12:00:00 AM
Abstract :
Tile algorithms for matrix decomposition can generate many fine-grained tasks. Therefore, their suitability for processing with multicourse architecture has attracted much attention from the high-performance computing (HPC) community. Our implementation of tile QR decomposition for a cluster system has dynamic scheduling, OpenMP work- sharing, and other useful features. In this article, we discuss the problems in internodes communications that were present in our previous implementation. The improved implementation has both strong and weak scalability.
Keywords :
"Matrix decomposition","Kernel","Dynamic scheduling","Instruction sets","Heuristic algorithms","Clustering algorithms","Multicore processing"
Conference_Titel :
Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International
DOI :
10.1109/IPDPSW.2015.145