Title :
A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters
Author :
Noack, Marko ; Wende, Florian ; Steinke, Thomas ; Cordes, Frank
Author_Institution :
Zuse Inst. Berlin, Berlin-Dahlem, Germany
Abstract :
Standard offload programming models for the Xeon Phi, e.g. Intel LEO and OpenMP 4.0, are restricted to a single compute node and hence a limited number of coprocessors. Scaling applications across a Xeon Phi cluster/supercomputer thus requires hybrid programming approaches, usually MPI+X. In this work, we present a framework based on heterogeneous active messages (HAM-Offload) that provides the means to offload work to local and remote (co)processors using a unified offload API. Since HAM-Offload provides similar primitives as current local offload frameworks, existing applications can be easily ported to overcome the single-node limitation while keeping the convenient offload programming model. We demonstrate the effectiveness of the framework by using it to enable a real-world application from the field of molecular dynamics to use multiple local and remote Xeon Phis. The evaluation shows good scaling behavior. Compared with LEO, performance is equal for large offloads and significantly better for small offloads.
Keywords :
application program interfaces; coprocessors; message passing; parallel machines; HAM-offload; Intel LEO; MPI+X; OpenMP 4.0; Xeon Phi clusters; compute node; coprocessors; heterogeneous active messages; hybrid programming approaches; inter-node offloading; intra-node offloading; molecular dynamics; scaling applications; scaling behavior; standard offload programming models; supercomputer; unified offload API; unified programming model; Computational modeling; Coprocessors; Data transfer; Libraries; Low earth orbit satellites; Performance evaluation; Programming;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4799-5499-5