مرکز منطقه ای اطلاع رساني علوم و فناوري - Resource Centered Computing Delivering High Parallel Performance

DocumentCode :

167225

Title :

Resource Centered Computing Delivering High Parallel Performance

Author :

Gustedt, Jens ; Vialle, Stephane ; Mercier, Patrick

Author_Institution :

INRIA Nancy - Grand Est, Nancy, France

fYear :

2014

fDate :

19-23 May 2014

Firstpage :

Lastpage :

Abstract :

Modern parallel programming requires a combination of different paradigms, expertise and tuning, that correspond to the different levels in today´s hierarchical architectures. To cope with the inherent difficulty, ORWL (ordered read-write locks) presents a new paradigm and toolbox centered around local or remote resources, such as data, processors or accelerators. ORWL programmers describe their computation in terms of access to these resources during critical sections. Exclusive or shared access to the resources is granted through FIFOs and with read-write semantic. ORWL partially replaces a classical runtime and offers a new API for resource centric parallel programming. We successfully ran an ORWL benchmark application on different parallel architectures (a multicore CPU cluster, a NUMA machine, a CPU+GPU cluster). When processing large data we achieved scalability and performance similar to a reference code built on top of MPI+OpenMP+CUDA. The integration of optimized kernels of scientific computing libraries (ATLAS and cuBLAS) has been almost effortless, and we were able to increase performance using both CPU and GPU cores on our hybrid hierarchical cluster simultaneously. We aim to make ORWL a new easy-to-use and efficient programming model and toolbox for parallel developers.

Keywords :

application program interfaces; graphics processing units; natural sciences computing; parallel architectures; parallel programming; API; CPU cores; GPU cores; MPI+OpenMP+CUDA; ORWL programmers; critical sections; high parallel performance; hybrid hierarchical cluster; local resources; ordered read-write locks; parallel architectures; parallel developers; read-write semantic; remote resources; resource centered computing; resource centric parallel programming; scientific computing libraries; Computational modeling; Computer architecture; Graphics processing units; Kernel; Parallel processing; Parallel programming; GPU; accelerators; clusters; experiments; performance; read-write locks; resource centered computing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International

Conference_Location :

Phoenix, AZ

Print_ISBN :

978-1-4799-4117-9

Type :

conf

DOI :

10.1109/IPDPSW.2014.14

Filename :

6969373

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=167225