• DocumentCode
    1871900
  • Title

    Optimizing the use of GPU memory in applications with large data sets

  • Author

    Satish, Nadathur ; Sundaram, Narayanan ; Keutzer, Kurt

  • Author_Institution
    Univ. of California, Berkeley, CA, USA
  • fYear
    2009
  • fDate
    16-19 Dec. 2009
  • Firstpage
    408
  • Lastpage
    418
  • Abstract
    With general purpose programmable GPUs becoming more and more popular, automated tools are needed to bridge the gap between achievable performance from highly parallel architectures and the performance required in applications. In this paper, we concentrate on improving GPU memory management for applications with large and intermediate data sets that do not completely fit in GPU memory. For such applications, the movement of the extra data to CPU memory must be carefully managed. In particular, we focus on solving the joint task scheduling and data transfer scheduling problem posed in (N. Sundaram et al., May 2009), and propose an algorithm that gives close to optimal results (as measured by running simulated annealing overnight) in terms of the amount of data transferred for image processing benchmarks such as edge detection and convolutional neural networks. Our results enable a reduction of up to 30× in the amount of data transfers compared to an unoptimized implementation. They are up to 2× better than the methods previously proposed in (N. Sundaram et al., May 2009) and less than 16% away from the optimal solution.
  • Keywords
    computer graphics; coprocessors; edge detection; neural nets; scheduling; simulated annealing; storage management; GPU memory management; convolutional neural networks; data transfer; data transfer scheduling problem; edge detection; general purpose programmable GPU; image processing; joint task scheduling; large data sets; Bandwidth; Bridges; Cellular neural networks; Image processing; Memory management; Moore´s Law; Neural networks; Parallel architectures; Particle measurements; Scheduling algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing (HiPC), 2009 International Conference on
  • Conference_Location
    Kochi
  • Print_ISBN
    978-1-4244-4922-4
  • Electronic_ISBN
    978-1-4244-4921-7
  • Type

    conf

  • DOI
    10.1109/HIPC.2009.5433185
  • Filename
    5433185