• DocumentCode
    117310
  • Title

    An investigation of Unified Memory Access performance in CUDA

  • Author

    Landaverde, Raphael ; Tiansheng Zhang ; Coskun, Ayse K. ; Herbordt, Martin

  • Author_Institution
    Electr. & Comput. Eng. Dept., Boston Univ., Boston, MA, USA
  • fYear
    2014
  • fDate
    9-11 Sept. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Managing memory between the CPU and GPU is a major challenge in GPU computing. A programming model, Unified Memory Access (UMA), has been recently introduced by Nvidia to simplify the complexities of memory management while claiming good overall performance. In this paper, we investigate this programming model and evaluate its performance and programming model simplifications based on our experimental results. We find that beyond on-demand data transfers to the CPU, the GPU is also able to request subsets of data it requires on demand. This feature allows UMA to outperform full data transfer methods for certain parallel applications and small data sizes. We also find, however, that for the majority of applications and memory access patterns, the performance overheads associated with UMA are significant, while the simplifications to the programming model restrict flexibility for adding future optimizations.
  • Keywords
    electronic data interchange; graphics processing units; parallel architectures; performance evaluation; shared memory systems; storage management; CPU; CUDA; GPU computing; Nvidia; UMA; data transfer method; memory access pattern; memory management; on-demand data transfer; performance evaluation; programming model simplification; unified memory access performance; Acceleration; Benchmark testing; Data transfer; Graphics processing units; Kernel; Programming; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Extreme Computing Conference (HPEC), 2014 IEEE
  • Conference_Location
    Waltham, MA
  • Print_ISBN
    978-1-4799-6232-7
  • Type

    conf

  • DOI
    10.1109/HPEC.2014.7040988
  • Filename
    7040988