DocumentCode :
2787319
Title :
Optimizing Data Layouts for Parallel Computation on Multicores
Author :
Zhang, Yuanrui ; Ding, Wei ; Liu, Jun ; Kandemir, Mahmut
Author_Institution :
Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
fYear :
2011
fDate :
10-14 Oct. 2011
Firstpage :
143
Lastpage :
154
Abstract :
The emergence of multicore platforms offers several opportunities for boosting application performance. These opportunities, which include parallelism and data locality benefits, require strong support from compilers as well as operating systems. Current compiler research targeting multicores mostly focuses on code restructuring and mapping. In this work, we explore automatic data layout transformation targeting multithreaded applications running on multicores. Our transformation considers both data access patterns exhibited by different threads of a multithreaded application and the on-chip cache topology of the target multicore architecture. It automatically determines a customized memory layout for each target array to minimize potential cache conflicts across threads. Our experiments show that, our optimization brings significant benefits over state-of-the-art data locality optimization strategies when tested using 30 benchmark programs on an Intel multicore machine. The results also indicate that this strategy is able to scale to larger core counts and it performs better with increased data set sizes.
Keywords :
cache storage; multi-threading; multiprocessing systems; operating systems (computers); optimising compilers; parallel memories; parallelising compilers; Intel multicore machine; automatic data layout transformation; code mapping; code restructuring; compilers; data access patterns; data layout optimization; data locality optimization strategy; memory layout; multicore architecture; multicore platforms; multithreaded applications; on-chip cache topology; operating systems; parallel computation; Argon; Arrays; Instruction sets; Layout; Multicore processing; Optimization; Vectors; cache hierarchy-aware; data layout transformation; multicore;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures and Compilation Techniques (PACT), 2011 International Conference on
Conference_Location :
Galveston, TX
ISSN :
1089-795X
Print_ISBN :
978-1-4577-1794-9
Type :
conf
DOI :
10.1109/PACT.2011.20
Filename :
6113796
Link To Document :
بازگشت