• DocumentCode
    2186752
  • Title

    Optimization and Parallelization of Emedge3D on Shared Memory Architecture

  • Author

    Kuhn, Michael ; Latu, Guillaume ; Genaud, Stephane ; Crouseilles, Nicolas

  • Author_Institution
    Icube, Univ. de Strasbourg, Strasbourg, France
  • fYear
    2013
  • fDate
    23-26 Sept. 2013
  • Firstpage
    503
  • Lastpage
    510
  • Abstract
    This paper presents a study of techniques used to speedup a scientific simulation code. The techniques include sequential optimizations as well as the parallelization with OpenMP. This work is carried out on two different multicore shared memory architectures, namely a cutting edge 8×8 core CPU and a more common 2×6 core board. Our target application is representative of many memory bound codes, and the techniques we present show how to overcome the burden of the memory bandwidth limit, which is quickly reached on multi-core or many-core with shared memory architectures. To achieve efficient speedups, strategies are applied to lower the computation costs, and to maximize the use of processors caches. Optimizations are: minimizing memory accesses, simplifying and reordering computations, and tiling loops. On 12 cores processor Intel X5675, aggregation of these optimizations results in an execution time 21.6 faster, compared to the original version on one core.
  • Keywords
    application program interfaces; cache storage; memory architecture; numerical analysis; optimisation; parallel processing; physics computing; plasma simulation; shared memory systems; Emedge3D optimization; Emedge3D parallelization; OpenMP; fluid numerical simulation code; memory access minimization; memory bandwidth limit; memory bound codes; multicore shared memory architecture; processor caches; scientific simulation code; sequential optimization; tiling loops; Bandwidth; Instruction sets; Mathematical model; Multicore processing; Optimization; Three-dimensional displays; Tiles; memory bandwidth bound; optimization; parallelization; scientific computing; shared memory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2013 15th International Symposium on
  • Conference_Location
    Timisoara
  • Print_ISBN
    978-1-4799-3035-7
  • Type

    conf

  • DOI
    10.1109/SYNASC.2013.72
  • Filename
    6821189