Title :
Performance Comparison with OpenMP Parallelization for Multi-core Systems
Author :
Yang, Chao-Tung ; Chang, Tzu-Chieh ; Wang, Hsien-Yi ; Chu, William C C ; Chang, Chih-Hung
Author_Institution :
Dept. of Comput. Sci., Tunghai Univ., Taichung, Taiwan
Abstract :
Today, the multi-core processor has occupied more and more market shares, and the programming personnel also must face the collision brought by the revolution of multi-core processor. Semiconductor scaling limits and associated power and thermal challenges limit performance growth for single-core microprocessors. This reason leads many microprocessor vendors to turn instead to multi-core chip organizations. So programmer or compiler explicitly parallelize the software is the key for enhance the performance on multi-core chip. At the same time, parallel processing is not only the opportunity but also a challenge. The programmer or compiler explicitly parallelize the software is the key for enhance the performance on multi-core chip. In this paper, what we want to know is there any effective way that can reduce our time on rewrite or can automatically parallel the program for multi-processing purpose and do speedup the processing. We discussed some tools that can automatically generate OpenMP directives from serial C/C++ codes, and compare them with each other include normal C/C++ code, and run on general computer and embedded system. Also we compared some tools that are specifically designed to extract the most of data parallelism from C and FORTRAN kernels and translate them into NVIDIA CUDA or OpenCL to know how mush fast after use them.
Keywords :
C++ language; application program interfaces; multiprocessing systems; parallel processing; program compilers; C/C++ codes; FORTRAN kernel; NVIDIA CUDA; OpenCL; OpenMP parallelization; multicore system; parallel processing; Central Processing Unit; Graphics processing unit; Jacobian matrices; Multicore processing; Parallel processing; Transforms; Auto-parallelization; CUDA; Multi-core; OpenMP; Parallel;
Conference_Titel :
Parallel and Distributed Processing with Applications (ISPA), 2011 IEEE 9th International Symposium on
Conference_Location :
Busan
Print_ISBN :
978-1-4577-0391-1
Electronic_ISBN :
978-0-7695-4428-1
DOI :
10.1109/ISPA.2011.60