DocumentCode
659414
Title
Optimizing the MapReduce framework on Intel Xeon Phi coprocessor
Author
Mian Lu ; Lei Zhang ; Huynh Phung Huynh ; Zhongliang Ong ; Yun Liang ; Bingsheng He ; Goh, Rick Siow Mong ; Huynh, Richard
fYear
2013
fDate
6-9 Oct. 2013
Firstpage
125
Lastpage
130
Abstract
MapReduce has become one of the most popular framework for building big-data applications. It was originally designed for distributed-computing, and has been extended to various hardware architectures, e.g., multi-core CPUs, GPUs and FPGAs. In this work, we develop the first MapReduce framework on the recently released Intel Xeon Phi coprocessor. We utilize advanced features of the Xeon Phi to achieve high performance. In order to take advantage of the SIMD vector processing units, we propose a vectorization friendly technique to assist the auto-vectorization as well as develop SIMD hash computation algorithms. Furthermore, we utilize MIMD hyper-threading to pipeline the map and reduce phases to improve the resource utilization. We also eliminate multiple local arrays but use low cost atomic operations on the global array for some applications, which can improve the thread scalability and data locality. We conduct comprehensive experiments to compare our optimized MapReduce framework with a state-of-the-art multi-core based MapReduce framework (Phoenix++). By evaluating six real-world applications, the experimental results show that our optimized framework is 1.2X to 38X faster than Phoenix++ for various applications on the Xeon Phi.
Keywords
coprocessors; multi-threading; parallel programming; FPGA; GPU; Intel Xeon Phi coprocessor; MIMD hyperthreading; MapReduce framework; Phoenix++ framework; SIMD vector processing units; atomic operations; big-data applications; central processing unit; data locality; distributed computing; field programmable gate array; graphics processing unit; hardware architectures; multicore CPU; single instruction multiple data; thread scalability; Arrays; Containers; Coprocessors; Hardware; Instruction sets; Multicore processing; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data, 2013 IEEE International Conference on
Conference_Location
Silicon Valley, CA
Type
conf
DOI
10.1109/BigData.2013.6691563
Filename
6691563
Link To Document