DocumentCode :
161011
Title :
Performance comparison of CPU and GPU on a discrete heterogeneous architecture
Author :
Thomas, Windsor ; Daruwala, R.D.
Author_Institution :
Dept. of Electr. Eng., Veermata Jijabai Technol. Inst., Mumbai, India
fYear :
2014
fDate :
4-5 April 2014
Firstpage :
271
Lastpage :
276
Abstract :
Today Graphics Processing Units (GPUs) in scientific computing have led the computing system to achieve tera-scale computing power to the laptops and peta-scale computing power to the clusters by combining multicore Central Processing Units (CPUs) and many core GPUs which can be called a heterogeneous computer architecture. This paper describes briefly an evolutionary journey of GPUs. For performance comparison, parameters considered are latency and throughput. So based on the execution time of a GPU and CPU for a given task, written with Compute Unified Device Architecture (CUDA) C language, the two parameters are measured with increasing size of workload. When the task size is increased GPU is found to be approximately 51% faster than the multithreaded CPU when GPU achieves 100% occupancy. Throughput of GPU is found to be 2.1 times higher than that of CPU for large task size. The GPU used is NVIDIA´s GeForce GT630M with CPU of Intel´s i-5 3210M 3rd generation processor.
Keywords :
C language; graphics processing units; multi-threading; multiprocessing systems; parallel architectures; performance evaluation; CPU execution time; CUDA C language; GPU execution time; Intel´s i-5 3210M third generation processor; NVIDIA GeForce GT630M; clusters; compute unified device architecture C language; graphics processing units; heterogeneous computer architecture; laptops; latency; many core GPU; multicore central processing units; performance comparison; peta-scale computing power; scientific computing; tera-scale computing power; throughput; Central Processing Unit; Computer architecture; Graphics processing units; Instruction sets; Kernel; Parallel processing; Throughput; CPU; CUDA; GPU; execution time; mulitocore; speed up; thread; throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits, Systems, Communication and Information Technology Applications (CSCITA), 2014 International Conference on
Conference_Location :
Mumbai
Type :
conf
DOI :
10.1109/CSCITA.2014.6839271
Filename :
6839271
Link To Document :
بازگشت