Title :
Data Transfer Matters for GPU Computing
Author :
Fujii, Yuka ; Azumi, Takuya ; Nishio, Nobuhiko ; Kato, Shigeo ; Edahiro, Masato
Author_Institution :
Grad. Sch. of Inf. Sci. & Eng., Ritsumeikan Univ., Kusatsu, Japan
Abstract :
Graphics processing units (GPUs) embrace many-core compute devices where massively parallel compute threads are offloaded from CPUs. This heterogeneous nature of GPU computing raises non-trivial data transfer problems especially against latency-critical real-time systems. However even the basic characteristics of data transfers associated with GPU computing are not well studied in the literature. In this paper, we investigate and characterize currently-achievable data transfer methods of cutting-edge GPU technology. We implement these methods using open-source software to compare their performance and latency for real-world systems. Our experimental results show that the hardware-assisted direct memory access (DMA) and the I/O read-and-write access methods are usually the most effective, while on-chip micro controllers inside the GPU are useful in terms of reducing the data transfer latency for concurrent multiple data streams. We also disclose that CPU priorities can protect the performance of GPU data transfers.
Keywords :
file organisation; graphics processing units; input-output programs; microcontrollers; multiprocessing systems; parallel processing; public domain software; real-time systems; DMA; GPU computing; I/O read-and-write access methods; data transfer latency; data transfer methods; graphics processing units; hardware-assisted direct memory access; many-core compute devices; on-chip microcontrollers; open-source software; parallel compute threads; real-world systems; Data transfer; Engines; Graphics processing units; Hardware; Microcontrollers; Performance evaluation; Real-time systems; Data Transfer; GPGPU; Latency; OS; Performance;
Conference_Titel :
Parallel and Distributed Systems (ICPADS), 2013 International Conference on
Conference_Location :
Seoul
DOI :
10.1109/ICPADS.2013.47