DocumentCode :
1998707
Title :
Compiler-Based Data Prefetching and Streaming Non-temporal Store Generation for the Intel(R) Xeon Phi(TM) Coprocessor
Author :
Krishnaiyer, Rakesh ; Kultursay, Emre ; Chawla, Pallvi ; Preis, Sebastian ; Zvezdin, Anatoly ; Saito, Hiroshi
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1575
Lastpage :
1586
Abstract :
The Intel® Xeon Phi™ coprocessor has software prefetching instructions to hide memory latencies and special store instructions to save bandwidth on streaming non-temporal store operations. In this work, we provide details on compiler-based generation of these instructions and evaluate their impact on the performance of the Intel® Xeon Phi™ coprocessor using a wide range of parallel applications with different characteristics. Our results show that the Intel® Composer XE 2013 compiler can make effective use of these mechanisms to achieve significant performance improvements.
Keywords :
coprocessors; parallel processing; performance evaluation; program compilers; storage management; Intel Composer XE 2013 compiler; Intel Xeon Phi coprocessor; bandwidth saving; compiler-based instruction generation; compiler-based software prefetching instructions; memory latencies; nontemporal store generation operation streaming; parallel applications; performance evaluation; Bandwidth; Coprocessors; Hardware; Prefetching; Vectors; Intel Xeon Phi; compiler; coprocessor; non-temporal store; performance; prefetching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.231
Filename :
6651054
Link To Document :
بازگشت