DocumentCode
228710
Title
Scaling MapReduce Vertically and Horizontally
Author
El-Helw, Ismail ; Hofman, Rutger ; Bal, Henri E.
Author_Institution
Dept. of Comput. Sci., Vrije Univ. Amsterdam, Amsterdam, Netherlands
fYear
2014
fDate
16-21 Nov. 2014
Firstpage
525
Lastpage
535
Abstract
Glass wing is a MapReduce framework that uses OpenCL to exploit multi-core CPUs and accelerators. However, compute device capabilities may vary significantly and require targeted optimization. Similarly, the availability of resources such as memory, storage and interconnects can severely impact overall job performance. In this paper, we present and analyze how MapReduce applications can improve their horizontal and vertical scalability by using a well controlled mixture of coarse- and fine-grained parallelism. Specifically, we discuss the Glass wing pipeline and its ability to overlap computation, communication, memory transfers and disk access. Additionally, we show how Glass wing can adapt to the distinct capabilities of a variety of compute devices by employing fine-grained parallelism. We experimentally evaluated the performance of five MapReduce applications and show that Glass wing outperforms Hadoop on a 64-node multi-core CPU cluster by factors between 1.2 and 4, and factors from 20 to 30 on a 23-node GPU cluster. Similarly, we show that Glass wing is at least 1.5 times faster than GPMR on the GPU cluster.
Keywords
data handling; graphics processing units; multiprocessing systems; parallel processing; pipeline processing; GPMR; GPU cluster; Glasswing pipeline; Hadoop; MapReduce applications; MapReduce framework; OpenCL; accelerators; coarse-grained parallelism; disk access; fine-grained parallelism; horizontal MapReduce scaling; horizontal scalability; multicore CPU; multicore CPU cluster; vertical MapReduce scaling; vertical scalability; Graphics processing units; Instruction sets; Kernel; Parallel processing; Performance evaluation; Pipelines; Scalability; Heterogeneous; MapReduce; OpenCL; Scalability;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
Conference_Location
New Orleans, LA
Print_ISBN
978-1-4799-5499-5
Type
conf
DOI
10.1109/SC.2014.48
Filename
7013030
Link To Document