• DocumentCode
    1553107
  • Title

    Hybrid algorithms for complete exchange in 2D meshes

  • Author

    Sundar, N.S. ; Jayasimha, D.N. ; Panda, Dhabaleswar K. ; Sadayappan, P.

  • Author_Institution
    Hewlett-Packard Co., Cupertino, CA, USA
  • Volume
    12
  • Issue
    12
  • fYear
    2001
  • fDate
    12/1/2001 12:00:00 AM
  • Firstpage
    1201
  • Lastpage
    1218
  • Abstract
    Parallel algorithms for several common problems such as sorting and the FFT involve a personalized exchange of data among all the processors. Past approaches to doing complete exchange have taken one of two broad approaches: direct exchange or the indirect message-combining approaches. While combining approaches reduce the number of message startups, direct exchange minimizes the volume of data transmitted. This paper presents a family of hybrid algorithms for wormhole-routed 2D meshes that can effectively utilize the complementary strengths of these two approaches to complete exchange. The performance of hybrid algorithms using Cyclic Exchange and Scott´s Direct Exchange are studied using analytical models, simulation, and implementation on a Cray T3D system. The results show that hybrids achieve lower completion times than either pure algorithm for a range of mesh sizes, data block sizes, and message startup costs. It is also demonstrated that barriers may be used to enhance performance by reducing message contention, whether or not the target system provides hardware support for barrier synchronization. The analytical models are shown useful in selecting the optimum hybrid for any given combination of system parameters (mesh size, message startup time, flit transfer time, and barrier cost) and the problem parameter (data block size)
  • Keywords
    fast Fourier transforms; multiprocessor interconnection networks; network routing; parallel algorithms; 2D Meshes; Cray T3D system; FFT; data block sizes; hybrid algorithms; message contention; message-combining approaches; parallel algorithms; simulation; sorting; wormhole-routed 2D meshes; Analytical models; Bandwidth; Cost function; Hardware; Helium; Message passing; Multidimensional systems; Routing; Sorting; Topology;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.970553
  • Filename
    970553