• DocumentCode
    1960153
  • Title

    Heterogeneous Algorithmic Skeletons for Fast Flow with Seamless Coordination over Hybrid Architectures

  • Author

    Goli, M. ; Gonzalez-Velez, H.

  • Author_Institution
    IDEAS Res. Inst., Robert Gordon Univ., Aberdeen, UK
  • fYear
    2013
  • fDate
    Feb. 27 2013-March 1 2013
  • Firstpage
    148
  • Lastpage
    156
  • Abstract
    Algorithmic skeletons (`skeletons´) abstract commonly-used patterns of parallel computation, communication, and interaction. They provide top-down design composition and control inheritance throughout the whole structure. The efficient execution of skeletal applications on a heterogeneous environment has long been of interest to the research community. Arguably, executing a coarse-grained resource-intensive skeletal workloads ought to achieve higher resource utilisation and, ultimately, better job makespan on heterogeneous systems due to the structured parallelism model. This paper presents a heterogeneous OpenCL-based GPU back-end for FastFlow, a widely-used skeletal framework. Our back-end allows the user to easily write any arbitrary OpenCL code inside an heterogeneous algorithmic skeleton and seamlessly control the allocation of OpenCL kernel over the hybrid (CPU/GPU) architecture. Our performance evaluation indicate that a skeletal program which employs our back-end is around one order of magnitude faster than a skeletal parallel program using the traditional homogeneous FastFlow skeletons with the serial version of OpenCL code.
  • Keywords
    graphics processing units; multiprocessing systems; parallel architectures; resource allocation; OpenCL code; OpenCL kernel; heterogeneous OpenCL-based GPU backend; heterogeneous algorithmic skeleton; heterogeneous environment; homogeneous FastFlow skeleton; hybrid architecture; job makespan; parallel computation; resource utilisation; seamless coordination; skeletal workload execution; structured parallelism model; top-down design composition; Computer architecture; Graphics processing units; Hardware; Kernel; Pipelines; Resource management; Skeleton; Algorithmic Skeletons; GPU; OpenCL; Parallel Patterns; Parallel Programming; Structured Parallelism;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2013 21st Euromicro International Conference on
  • Conference_Location
    Belfast
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4673-5321-2
  • Electronic_ISBN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2013.29
  • Filename
    6498546