• DocumentCode
    1987519
  • Title

    Porting to the Intel Xeon Phi: Opportunities and Challenges

  • Author

    Rosales, Carlos

  • Author_Institution
    Texas Adv. Comput. Center, Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2013
  • fDate
    15-16 Aug. 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    This work describes the challenges presented by porting code to the Intel Xeon Phi coprocessor, as well as opportunities for optimization and tuning. We use micro-benchmarks, code segments, assembly listings and application level results to illustrate the key issues in porting to the Xeon Phi coprocessor, always keeping in mind both portability and performance. While executing code on the Xeon Phi in native mode is fairly straightforward it can be a challenge to achieve good performance. The complexity of optimization increases as one introduces offload, distributed offload, or symmetric execution modes. We will initially focus on the fundamental issues that can prevent acceptable performance in native execution, and then address the key issues in data transfers due to either offloaded regions or MPI exchanges with the host CPU. Some of the issues are of a generic nature and affect any code using heterogeneous execution - PCIe bandwidth bottleneck -, and others are specific to the Xeon Phi and its software environment - Host/MIC MPI exchanges. We will also make an effort to indicate which issues are specific to this platform and which are of general applicability. In particular we will draw comparisons between the data management models in the Intel Xeon Phi and in the NVIDIA CUDA environment.
  • Keywords
    coprocessors; Intel Xeon Phi coprocessor; MPI exchanges; NVIDIA CUDA environment; PCIe bandwidth bottleneck; assembly listings; code segments; data management models; data transfers; distributed offload; heterogeneous execution; host CPU; native execution; optimization; portability; porting code; software environment; symmetric execution modes; tuning; Arrays; Bandwidth; Coprocessors; Hardware; Instruction sets; Microwave integrated circuits; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Extreme Scaling Workshop (XSW), 2013
  • Conference_Location
    Boulder, CO
  • Type

    conf

  • DOI
    10.1109/XSW.2013.5
  • Filename
    6805036