• DocumentCode
    3543566
  • Title

    Optimization Techniques and Performance Analyses of Two Life Science Algorithms for Novel GPU Architectures

  • Author

    Dilch, David ; Mehofer, Eduard

  • Author_Institution
    Res. Group Sci. Comput., Univ. of Vienna, Vienna, Austria
  • fYear
    2012
  • fDate
    15-17 Feb. 2012
  • Firstpage
    376
  • Lastpage
    383
  • Abstract
    In this paper we evaluate two life science algorithms, namely Needleman-Wunsch sequence alignment and Direct Coulomb Summation, for GPUs. Whereas for Needleman-Wunsch it is difficult to get good performance numbers, Direct Coulomb Summation is particularly suitable for graphics cards. We present several optimization techniques, analyze the theoretical potential of the optimizations with respect to the algorithms, and measure the effect on execution times. We target the recent NVIDIA Fermi architecture to evaluate the performance impacts of novel hardware features like the cache subsystem on optimizing transformations. We compare the execution times of CUDA and OpenCL code versions for Fermi and predecessor models with parallel OpenMP versions executed on the main CPU.
  • Keywords
    cache storage; graphics processing units; optimisation; parallel architectures; performance evaluation; CUDA; GPU architectures; NVIDIA Fermi architecture; Needleman-Wunsch sequence alignment; OpenCL code versions; cache subsystem; direct Coulomb summation; graphics cards; life science algorithms; main CPU; optimization techniques; parallel OpenMP versions; performance analyses; Computer architecture; Graphics processing unit; Hardware; Instruction sets; Kernel; Optimization; Synchronization; CUDA; Fermi; OpenCL; life science algorithms; optimizations; performance evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2012 20th Euromicro International Conference on
  • Conference_Location
    Garching
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4673-0226-5
  • Type

    conf

  • DOI
    10.1109/PDP.2012.87
  • Filename
    6169575