• DocumentCode
    1960866
  • Title

    Optimization Techniques for Dimensionally Truncated Sparse Grids on Heterogeneous Systems

  • Author

    Deftu, A. ; Murarasu, Alin

  • fYear
    2013
  • fDate
    Feb. 27 2013-March 1 2013
  • Firstpage
    351
  • Lastpage
    358
  • Abstract
    Given the existing heterogeneous processor landscape dominated by CPUs and GPUs, topics such as programming productivity and performance portability have become increasingly important. In this context, an important question refers to how can we develop optimization strategies that cover both CPUs and GPUs. We answer this for fastsg, a library that provides functionality for handling efficiently high-dimensional functions. As it can be employed for compressing and decompressing large-scale simulation data, it finds itself at the core of a computational steering application which serves us as test case. We describe our experience with implementing fastsg\´s time critical routines for Intel CPUs and Nvidia Fermi GPUs. We show the differences and especially the similarities between our optimization strategies for the two architectures. With regard to our test case for which achieving high speedups is a "must\´" for real-time visualization, we report a speedup of up to 6.2x times compared to the state-of-the-art implementation of the sparse grid technique for GPUs.
  • Keywords
    data compression; data visualisation; grid computing; parallel architectures; real-time systems; software libraries; Intel CPU; Nvidia Fermi GPU; computational steering application; dimensionally truncated sparse grids; fastsg library; heterogeneous processor; high-dimensional function handling; large-scale simulation data compression; large-scale simulation data decompression; optimization strategies; optimization techniques; performance portability; programming productivity; real-time visualization; sparse grid technique; time critical routines; Computational modeling; Computer architecture; Graphics processing units; Instruction sets; Optimization; Programming; Vectors; CUDA; GPU; library; optimizations; sparse grids;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2013 21st Euromicro International Conference on
  • Conference_Location
    Belfast
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4673-5321-2
  • Electronic_ISBN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2013.57
  • Filename
    6498575