• DocumentCode
    3142656
  • Title

    Analysis of a Memory Bandwidth Limited Scenario for NUMA and GPU Systems

  • Author

    Breitbart, Jens

  • Author_Institution
    Res. Group Program. Languages / Methodologies, Univ. Kassel, Kassel, Germany
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    693
  • Lastpage
    699
  • Abstract
    The processing power and parallelism in hardware is expected to increase rapidly over the next years, whereas memory bandwidth per flop and the amount of main memory per flop will be falling behind. These trends will result in both more algorithms to become limited by memory bandwidth, and overall memory requirements to become an important factor for algorithm design. In this paper we study the Gauss-Seidel stencil as an example of a memory bandwidth limited algorithm. We consider GPUs and NUMA systems, which are both designed to provide high memory bandwidth at the cost of making algorithm design more complex. The mapping of the non-linear memory access pattern of the Gauss-Seidel stencil to the different hardware is important to achieve high performance. We show that there is a trade-off between overall performance and memory requirements when optimizing for optimal memory access pattern. Vectorizing on the NUMA system and optimizing to utilize all processors on the GPU does not pay off in terms of performance per memory used, which we consider an important measurement regarding the trends named before.
  • Keywords
    computer graphic equipment; coprocessors; file organisation; iterative methods; multiprocessing systems; GPU system; Gauss-Seidel stencil; NUMA system; algorithm design; memory bandwidth limited algorithm; memory requirements; multicore CPU; nonlinear memory access pattern mapping; Bandwidth; Hardware; Instruction sets; Layout; Memory management; Sockets; Tiles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.193
  • Filename
    6008838