• DocumentCode
    3041415
  • Title

    Performance comparison of pure MPI vs hybrid MPI-OpenMP parallelization models on SMP clusters

  • Author

    Drosinos, Nikolaos ; Koziris, Nectarios

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Greece
  • fYear
    2004
  • fDate
    26-30 April 2004
  • Firstpage
    15
  • Abstract
    Summary form only given. We compare the performance of three programming paradigms for the parallelization of nested loop algorithms onto SMP clusters. More specifically, we propose three alternative models for tiled nested loop algorithms, namely a pure message passing paradigm, as well as two hybrid ones, that implement communication both through message passing and shared memory access. The hybrid models adopt an advanced hyperplane scheduling scheme, that allows both for minimal thread synchronization, as well as for pipelined execution with overlapping of computation and communication phases. We focus on the experimental evaluation of all three models, and test their performance against several iteration spaces and parallelization grains with the aid of a typical microkernel benchmark. We conclude that the hybrid models can in some cases be more beneficial compared to the monolithic pure message passing model, as they exploit better the configuration characteristics of an hierarchical parallel platform, such as an SMP cluster.
  • Keywords
    distributed shared memory systems; message passing; open systems; pipeline processing; synchronisation; workstation clusters; MPI; OpenMP parallelization; SMP clusters; hybrid model; hyperplane scheduling scheme; message passing paradigm; microkernel benchmark; nested loop algorithm; pipelined execution; shared memory access; thread synchronization; Clustering algorithms; Concurrent computing; Electronic mail; Laboratories; Message passing; Parallel programming; Processor scheduling; Systems engineering and theory; Testing; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
  • Print_ISBN
    0-7695-2132-0
  • Type

    conf

  • DOI
    10.1109/IPDPS.2004.1302919
  • Filename
    1302919