• DocumentCode
    2535477
  • Title

    Rearchitecting MapReduce for Heterogeneous Multicore Processors with Explicitly Managed Memories

  • Author

    Papagiannis, Anastasios ; Nikolopoulos, Dimitrios S.

  • Author_Institution
    Inst. of Comput. Sci. (ICS), Found. for Res. & Technol. Hellas (FORTH), Heraklion, Greece
  • fYear
    2010
  • fDate
    13-16 Sept. 2010
  • Firstpage
    121
  • Lastpage
    130
  • Abstract
    This paper presents a new design and an implementation of the runtime system of MapReduce for heterogeneous multicore processors with explicitly managed local memories. We advance the state of the art in runtime support for MapReduce using five instruments: (1) A new multi-threaded, event-driven controller for task instantiation, task scheduling, synchronization, and bulk-synchronous execution of MapReduce stages. The controller improves utilization of control efficient cores, minimizes control overhead in the runtime system, and overlaps task instantiation with task scheduling on compute-efficient cores. (2) An implicit partitioning scheme which eliminates redundant memory copies. (3) An adaptive memory management scheme which combines efficient memory preallocation for applications with statically known output volume with dynamic allocation using runahead tasks for applications with statically unknown output volume. (4) An optimized quick-sort/merge-sort scheme which reduces the critical path length of merge-sort. (5) An optimized execution scheme which avoids redundant data transfers to and from local stores in applications that emit keys with the same value. Put together, these techniques accelerate representative MapReduce workloads by a factor of 1.81x (geometric mean) compared to a reference design that represents the state of the art.
  • Keywords
    multiprocessing systems; parallel programming; program processors; scheduling; storage management; MapReduce; dynamic allocation; event-driven controller; heterogeneous multicore processors; implicit partitioning scheme; memory management scheme; memory preallocation; merge-sort scheme; quick-sort scheme; runtime system; task instantiation; task scheduling; Instruction sets; Memory management; Multicore processing; Processor scheduling; Runtime; Heterogeneous multicore processors; MapReduce; Operating Systems; Parallel Programming Models; Resource management; Runtime systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2010 39th International Conference on
  • Conference_Location
    San Diego, CA
  • ISSN
    0190-3918
  • Print_ISBN
    978-1-4244-7913-9
  • Electronic_ISBN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2010.21
  • Filename
    5599156