DocumentCode :
2996390
Title :
Performance Study of SIMD Programming Models on Intel Multicore Processors
Author :
Kristof, Peter ; Yu, Hongtao ; Li, Zhiyuan ; Tian, Xinmin
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
fYear :
2012
fDate :
21-25 May 2012
Firstpage :
2423
Lastpage :
2432
Abstract :
Modern multicore hardware employs a variety of parallel execution units, including multiple CPU cores for executing multiple threads simultaneously, vector units such as the Intel SIMD on the CPU cores, as well as GPU-like processing arrays. Availability of such unprecedented level of parallelism on main-stream computers offers an enormous potential to enable a new generation of computation-intensive nontraditional applications. On the other hand, how to best harness the hardware parallelism presents a new challenge to application programmers, language designers and compiler developers. In this paper, we evaluate the impact of several different parallel execution models, especially the new SIMD vectorization methods, supported by the latest Intel ICC compiler (version 12.1), using three computation-intensive nontraditional parallel applications as the test workload. Unlike traditional numerical programs, these applications use highly irregular data structures and therefore present nontrivial challenges to effective use of SIMD vector units. The first application is a game engine architecture requiring real-time performance. The second application involves a kd-tree traversal, which is typical to the state-of-the-art 3D ray-tracing applications. The last application processes data for large-scale weather visualization system in the order of tens of minutes. We compare the execution time of these codes using different SIMD models supported by ICC in conjunction with parallel threading under TBB and OpenMP.
Keywords :
computer games; data visualisation; meteorology; multiprocessing systems; parallel programming; ray tracing; 3D ray-tracing application; GPU-like processing arrays; Intel ICC compiler; Intel SIMD; Intel multicore processors; OpenMP; SIMD programming models; SIMD vectorization method; TBB; application programmers; compiler developers; game engine architecture; hardware parallelism; kd-tree traversal; language designers; large-scale weather visualization system; multicore hardware; multiple CPU cores; multiple threads; parallel execution models; parallel execution units; parallel threading; real-time performance; Benchmark testing; Computational modeling; Engines; Games; Parallel processing; Program processors; Vectors; Intel ICC compiler; SIMD; computer graphics; optimization; parallel programming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0974-5
Type :
conf
DOI :
10.1109/IPDPSW.2012.299
Filename :
6270614
Link To Document :
بازگشت