• DocumentCode
    2727192
  • Title

    Parallel tree building on a range of shared address space multiprocessors: algorithms and application performance

  • Author

    Shan, Hongzhang ; Singh, Jaswinder Pal

  • Author_Institution
    Dept. of Comput. Sci., Princeton Univ., NJ, USA
  • fYear
    1998
  • fDate
    30 Mar-3 Apr 1998
  • Firstpage
    475
  • Lastpage
    484
  • Abstract
    Irregular, particle-based applications that use trees, for example hierarchical N-body applications, are important consumers of multiprocessor cycles, and are argued to benefit greatly in programming ease from a coherent shared address space programming model. As more and more supercomputing platforms that can support different programming models become available to users, from tightly-coupled hardware-coherent machines to clusters of workstations or SMPs, to truly deliver on its ease of programming advantages to application users it is important that the shared address space model not only perform and scale well in the tightly-coupled case but also port well in performance across the range of platforms (as the message passing model can). For tree-based N-body applications, this is currently not true: while the actual computation of interactions ports well, the parallel tree building phase can become a severe bottleneck on coherent shared address space platforms, in particular on platforms with less aggressive, commodity-oriented communication architectures (even though it takes less 3 percent of the time in most sequential executions). The authors therefore investigate the performance of five parallel tree building methods in the context of a complete galaxy simulation on four very different platforms that support this programming model
  • Keywords
    parallel algorithms; parallel architectures; parallel machines; parallel programming; shared memory systems; software performance evaluation; trees (mathematics); algorithms; application performance; coherent shared address space platforms; coherent shared address space programming model; commodity-oriented communication architectures; complete galaxy simulation; hierarchical N-body applications; interaction computation; irregular particle-based applications; multiprocessor cycles; parallel tree building; programming ease; shared address space multiprocessors; supercomputing platforms; tightly-coupled hardware-coherent machines; trees; workstation clusters; Application software; Buildings; Coherence; Context modeling; Dynamic programming; Hardware; Message passing; Partitioning algorithms; Protocols; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
  • Conference_Location
    Orlando, FL
  • ISSN
    1063-7133
  • Print_ISBN
    0-8186-8404-6
  • Type

    conf

  • DOI
    10.1109/IPPS.1998.669959
  • Filename
    669959