• DocumentCode
    3706505
  • Title

    Open ACC Programs Examined: A Performance Analysis Approach

  • Author

    Robert Dietrich;Guido Juckeland;Michael Wolfe

  • Author_Institution
    Center for Inf. Services &
  • fYear
    2015
  • Firstpage
    310
  • Lastpage
    319
  • Abstract
    The Open ACC standard has been developed to simplify parallel programming of heterogeneous systems. Based on a set of high-level compiler directives it allows application developers to offload code regions from a host CPU to an accelerator without the need for low-level programming with CUDA or Open CL. Details are implicit in the programming model and managed by Open ACC API-enabled compilers and runtimes. However, it is still possible for the application developer to explicitly specify several performance-related details for the execution. To tune an Open ACC program and efficiently utilize available hardware resources, sophisticated performance analysis tools are required. In this paper we present a framework for detailed analysis of Open ACC applications. We describe new analysis capabilities introduced with an Open ACC tools interface and depict the integration of performance analysis for low-level programming models. As proof of concept we implemented the concept into the measurement infrastructure Score-P and the trace browser Vampir. This provides the program developer with a clearer understanding of the dynamic runtime behavior of the application and for systematic identification of potential bottlenecks.
  • Keywords
    "Runtime","Graphics processing units","Programming","Libraries","Kernel","Computational modeling","Instruction sets"
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2015 44th International Conference on
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2015.40
  • Filename
    7349586