Title :
Performance model for OpenMP parallelized loops
Author :
Zheng, Zhong ; Chen, Xuhao ; Wang, Zhiying ; Shen, Li ; Li, Jiawen
Author_Institution :
Sch. of Comput., Nat. Univ. of Defense Technol., Changsha, China
Abstract :
OpenMP is one of the most widely used parallel programming techniques in modern multi-core era. Parallelizing a loop using OpenMP is just as simple as adding a few directive sentences. However, for its simplicity, it is not rare that programmers excessively use OpenMP to parallelize loops in various applications which introduce too much overhead and lead to performance degradation. This paper establishes a performance model for OpenMP parallelized loops to address the critical factors which influence the performance. The model is validated through experiments on three different multi-core platforms. The results shows that best performance can be obtained when number of threads used in OpenMP applications equals to the number of cores that available on the platform. And parallelizing the outmost loop in nested loops can get higher speedup.
Keywords :
application program interfaces; parallel programming; program control structures; OpenMP parallelized loops; multicore platforms; nested loops; parallel programming techniques; performance model; Analytical models; Computational modeling; Explosions; Instruction sets; Libraries; Multicore processing; Operating systems; Loop; OpenMP; Performance model;
Conference_Titel :
Transportation, Mechanical, and Electrical Engineering (TMEE), 2011 International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4577-1700-0
DOI :
10.1109/TMEE.2011.6199223