Title :
Supporting Low-Latency CPS Using GPUs and Direct I/O Schemes
Author :
Aumiller, Jason ; Brandt, Scott ; Kato, Shinpei ; Rath, Nikolaus
Author_Institution :
Dept. of Comput. Sci., Univ. of California, Santa Cruz, CA, USA
Abstract :
Graphics processing units (GPUs) are increasingly being used for general purpose parallel computing. They provide significant performance gains over multi-core CPU systems, and are an easily accessible alternative to supercomputers. The architecture of general purpose GPU systems(GPGPU), however, poses challenges in efficiently transferring data among the host and device(s). Although commodity many core devices such as NVIDIA GPUs provide more than one way to move data around, it is unclear which method is most effective given a particular application. This presents difficulty in supporting latency-sensitive cyber-physical systems (CPS). In this work we present a new approach to data transfer in a heterogeneous computing system that allows direct communication between GPUs and other I/O devices. In addition to adding this functionality our system also improves communication between the GPU and host. We analyze the current vendor provided data communication mechanisms and identify which methods work best for particular tasks with respect to throughput, and total time to completion. Our method allows a new class of real-time cyber-physical applications to be implemented on a GPGPU system. The results of the experiments presented here show that GPU tasks can be completed in 34 percent less time than current methods. Furthermore, effective data throughput is at least as good as the current best performers. This work is part of concurrent development of Gdev, an open-source project to provide Linux operating system support of many-core device resource management.
Keywords :
Linux; concurrency control; data communication; graphics processing units; multiprocessing systems; parallel processing; public domain software; real-time systems; resource allocation; GPGPU; Linux operating system; concurrent Gdev development; data communication mechanisms; data throughput; data transfer; direct I/O schemes; general purpose GPU systems; general purpose parallel computing; graphics processing units; heterogeneous computing system; latency-sensitive cyber-physical systems; low-latency CPS; many-core device resource management; multicore CPU systems; open source project; real-time cyber-physical applications; Central Processing Unit; Computer architecture; Data communication; Graphics processing unit; Kernel; Random access memory; Throughput; GPGPU; GPU communication; real time systems;
Conference_Titel :
Embedded and Real-Time Computing Systems and Applications (RTCSA), 2012 IEEE 18th International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-3017-6
Electronic_ISBN :
1533-2306
DOI :
10.1109/RTCSA.2012.59