• DocumentCode
    2133048
  • Title

    Effects of multithreading on data and workload distribution for distributed-memory multiprocessors

  • Author

    Sohn, Andrew ; Sato, Mitsuhisa ; Yoo, Namhoon ; Gaudiot, Jean-Luc

  • Author_Institution
    Dept. of Comput. & Inf. Sci., New Jersey Inst. of Technol., Newark, NJ, USA
  • fYear
    1996
  • fDate
    15-19 Apr 1996
  • Firstpage
    116
  • Lastpage
    122
  • Abstract
    While data and workload distribution can be tailored to fit a particular problem to a particular distributed-memory architecture, it is often difficult to do so for various practical issues. This paper presents our study on multithreading for distributed-memory multiprocessors. Specifically, we investigate the effects of multithreading on data distribution and workload distribution with variable thread granularity. Various types of workload distribution strategies are defined along thread granularity. Three types of data distribution strategies are investigated: row-wise cyclic, k-way partial-row cyclic and blocked distribution. We have implemented all of these on the 80-processor EM-4 distributed-memory multiprocessor using highly-sequential Gaussian elimination with partial pivoting and highly-parallel matrix multiplication. Experimental results indicated that multithreading can offset the loss that is due to the mismatch of data distribution to workload distribution for even sequential and irregular problems while giving high absolute performance
  • Keywords
    distributed memory systems; matrix multiplication; memory architecture; parallel architectures; EM-4 distributed-memory multiprocessor; Gaussian elimination; absolute performance; blocked distribution; data distribution; distributed-memory architecture; irregular problems; k-way partial-row cyclic distribution; multithreading; parallel matrix multiplication; partial pivoting; row-wise cyclic distribution; sequential problems; variable thread granularity; workload distribution; Buildings; Computer architecture; Costs; Delay; Distribution strategy; Multithreading; Parallel machines; Performance loss; Runtime; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Symposium, 1996., Proceedings of IPPS '96, The 10th International
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    0-8186-7255-2
  • Type

    conf

  • DOI
    10.1109/IPPS.1996.508047
  • Filename
    508047