• DocumentCode
    1998707
  • Title

    Compiler-Based Data Prefetching and Streaming Non-temporal Store Generation for the Intel(R) Xeon Phi(TM) Coprocessor

  • Author

    Krishnaiyer, Rakesh ; Kultursay, Emre ; Chawla, Pallvi ; Preis, Sebastian ; Zvezdin, Anatoly ; Saito, Hiroshi

  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1575
  • Lastpage
    1586
  • Abstract
    The Intel® Xeon Phi™ coprocessor has software prefetching instructions to hide memory latencies and special store instructions to save bandwidth on streaming non-temporal store operations. In this work, we provide details on compiler-based generation of these instructions and evaluate their impact on the performance of the Intel® Xeon Phi™ coprocessor using a wide range of parallel applications with different characteristics. Our results show that the Intel® Composer XE 2013 compiler can make effective use of these mechanisms to achieve significant performance improvements.
  • Keywords
    coprocessors; parallel processing; performance evaluation; program compilers; storage management; Intel Composer XE 2013 compiler; Intel Xeon Phi coprocessor; bandwidth saving; compiler-based instruction generation; compiler-based software prefetching instructions; memory latencies; nontemporal store generation operation streaming; parallel applications; performance evaluation; Bandwidth; Coprocessors; Hardware; Prefetching; Vectors; Intel Xeon Phi; compiler; coprocessor; non-temporal store; performance; prefetching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.231
  • Filename
    6651054