• DocumentCode
    228710
  • Title

    Scaling MapReduce Vertically and Horizontally

  • Author

    El-Helw, Ismail ; Hofman, Rutger ; Bal, Henri E.

  • Author_Institution
    Dept. of Comput. Sci., Vrije Univ. Amsterdam, Amsterdam, Netherlands
  • fYear
    2014
  • fDate
    16-21 Nov. 2014
  • Firstpage
    525
  • Lastpage
    535
  • Abstract
    Glass wing is a MapReduce framework that uses OpenCL to exploit multi-core CPUs and accelerators. However, compute device capabilities may vary significantly and require targeted optimization. Similarly, the availability of resources such as memory, storage and interconnects can severely impact overall job performance. In this paper, we present and analyze how MapReduce applications can improve their horizontal and vertical scalability by using a well controlled mixture of coarse- and fine-grained parallelism. Specifically, we discuss the Glass wing pipeline and its ability to overlap computation, communication, memory transfers and disk access. Additionally, we show how Glass wing can adapt to the distinct capabilities of a variety of compute devices by employing fine-grained parallelism. We experimentally evaluated the performance of five MapReduce applications and show that Glass wing outperforms Hadoop on a 64-node multi-core CPU cluster by factors between 1.2 and 4, and factors from 20 to 30 on a 23-node GPU cluster. Similarly, we show that Glass wing is at least 1.5 times faster than GPMR on the GPU cluster.
  • Keywords
    data handling; graphics processing units; multiprocessing systems; parallel processing; pipeline processing; GPMR; GPU cluster; Glasswing pipeline; Hadoop; MapReduce applications; MapReduce framework; OpenCL; accelerators; coarse-grained parallelism; disk access; fine-grained parallelism; horizontal MapReduce scaling; horizontal scalability; multicore CPU; multicore CPU cluster; vertical MapReduce scaling; vertical scalability; Graphics processing units; Instruction sets; Kernel; Parallel processing; Performance evaluation; Pipelines; Scalability; Heterogeneous; MapReduce; OpenCL; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4799-5499-5
  • Type

    conf

  • DOI
    10.1109/SC.2014.48
  • Filename
    7013030