• DocumentCode
    1827214
  • Title

    DMA-Assisted, Intranode Communication in GPU Accelerated Systems

  • Author

    Ji, Feng ; Aji, Ashwin M. ; Dinan, James ; Buntinas, Darius ; Balaji, Pavan ; Thakur, Rajeev ; Feng, Wu-chun ; Ma, Xiaosong

  • Author_Institution
    Dept. of Comput. Sci., North Carolina State Univ., Raleigh, NC, USA
  • fYear
    2012
  • fDate
    25-27 June 2012
  • Firstpage
    461
  • Lastpage
    468
  • Abstract
    Accelerator awareness has become a pressing issue in data movement models, such as MPI, because of the rapid deployment of systems that utilize accelerators. In our previous work, we developed techniques to enhance MPI with accelerator awareness, thus allowing applications to easily and efficiently communicate data between accelerator memories. In this paper, we extend this work with techniques to perform efficient data movement between accelerators within the same node using a DMA-assisted, peer-to-peer intranode communication technique that was recently introduced for NVIDIA GPUs. We present a detailed design of our new approach to intranode communication and evaluate its improvement to communication and application performance using micro-kernel benchmarks and a 2D stencil application kernel.
  • Keywords
    application program interfaces; file organisation; graphics processing units; message passing; parallel architectures; peer-to-peer computing; 2D stencil application kernel; DMA-assisted peer-to-peer intranode communication technique; GPU accelerated systems; MPI; NVIDIA GPU; accelerator awareness; accelerator memories; data movement models; direct memory access; graphics processing units; message passing interface; microkernel benchmarks; Engines; Graphics processing unit; Peer to peer computing; Performance evaluation; Protocols; Receivers; GPU; Intranode communication; MPI;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
  • Conference_Location
    Liverpool
  • Print_ISBN
    978-1-4673-2164-8
  • Type

    conf

  • DOI
    10.1109/HPCC.2012.69
  • Filename
    6332208