• DocumentCode
    3664307
  • Title

    Improved Internode Communication for Tile QR Decomposition for Multicore Cluster Systems

  • Author

    Tomohiro Suzuki

  • Author_Institution
    Dept. of Interdiscipl. Res., Univ. of Yamanashi, Kofu, Japan
  • fYear
    2015
  • fDate
    5/1/2015 12:00:00 AM
  • Firstpage
    1214
  • Lastpage
    1220
  • Abstract
    Tile algorithms for matrix decomposition can generate many fine-grained tasks. Therefore, their suitability for processing with multicourse architecture has attracted much attention from the high-performance computing (HPC) community. Our implementation of tile QR decomposition for a cluster system has dynamic scheduling, OpenMP work- sharing, and other useful features. In this article, we discuss the problems in internodes communications that were present in our previous implementation. The improved implementation has both strong and weak scalability.
  • Keywords
    "Matrix decomposition","Kernel","Dynamic scheduling","Instruction sets","Heuristic algorithms","Clustering algorithms","Multicore processing"
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2015.145
  • Filename
    7284451