• DocumentCode
    2747395
  • Title

    Exploiting global structure for performance on clusters

  • Author

    Donaldson, Stephen R. ; Hill, Jonathan M D ; Skillicom, D.B.

  • Author_Institution
    Comput. Lab., Oxford Univ., UK
  • fYear
    1999
  • fDate
    12-16 Apr 1999
  • Firstpage
    176
  • Lastpage
    182
  • Abstract
    Most parallel programming models for distributed-memory architectures are based on individual threads interacting via send and receive operations. We show that a more structured model, BSP, gains substantial performance improvements by exploiting the extra information implicit in its structure. In particular each thread learns something about global state whenever it receives a message. This information can be used to modify its own behavior to improve collective use of the communication system. The programming model´s semantics also provides implicit knowledge that can be exploited to increase performance. We show that these effects are useful at the application level by comparing the performance of BSP and MPI implementations of the NAS parallel benchmarks
  • Keywords
    distributed memory systems; parallel programming; performance evaluation; workstation clusters; BSP; NAS parallel benchmarks; communication system; distributed-memory architectures; global structure; implicit knowledge; parallel programming models; performance improvements; structured model; Buildings; Concurrent computing; Electrical capacitance tomography; Information science; Laboratories; Libraries; Parallel processing; Programming profession; Skeleton; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 1999. 13th International and 10th Symposium on Parallel and Distributed Processing, 1999. 1999 IPPS/SPDP. Proceedings
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7695-0143-5
  • Type

    conf

  • DOI
    10.1109/IPPS.1999.760455
  • Filename
    760455