• DocumentCode
    3237889
  • Title

    Extending OpenMP For NUMA Machines

  • Author

    Bircsak, John ; Craig, Peter ; Crowell, RaeLyn ; Cvetanovic, Zarka ; Harris, Jonathan ; Nelson, C. Alexander ; Offner, Carl D.

  • Author_Institution
    Compaq Computer Corporation
  • fYear
    2000
  • fDate
    04-10 Nov. 2000
  • Firstpage
    48
  • Lastpage
    48
  • Abstract
    This paper describes extensions to OpenMP that implemen data placemen features needed for NUMA architectures. OpenMP is a collection of compiler directives and library routines used to write portable parallel programs for shared-memory architectures. Writing efficient parallel programs for NUMA architectures, which have characteristics of both shared-memory and distributed-memory architectures, requires that a programmer control the placement of data in memory and the placement of computations that operate on that data. Optimal performance is obtained when computations occur on processors that have fast access to the data needed by those computations. OpenMP-designed for shared-memory architectures-does not by itself address these issues. The extensions to OpenMP Fortran presented here have been mainly taken from High Performance Fortran. The paper describes some of the techniques that the Compaq Fortran compiler uses to generate efficient code based on these extensions. I also describes some additional compiler optimizations, and concludes with some preliminary results.
  • Keywords
    Application software; Bandwidth; Computer architecture; Concurrent computing; Distributed computing; Libraries; Optimizing compilers; Program processors; Programming profession; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, ACM/IEEE 2000 Conference
  • ISSN
    1063-9535
  • Print_ISBN
    0-7803-9802-5
  • Type

    conf

  • DOI
    10.1109/SC.2000.10019
  • Filename
    1592761