DocumentCode :
1783321
Title :
DataMPI: Extending MPI to Hadoop-Like Big Data Computing
Author :
Xiaoyi Lu ; Fan Liang ; Bing Wang ; Li Zha ; Zhiwei Xu
Author_Institution :
Inst. of Comput. Technol., Beijing, China
fYear :
2014
fDate :
19-23 May 2014
Firstpage :
829
Lastpage :
838
Abstract :
MPI has been widely used in High Performance Computing. In contrast, such efficient communication support is lacking in the field of Big Data Computing, where communication is realized by time consuming techniques such as HTTP/RPC. This paper takes a step in bridging these two fields by extending MPI to support Hadoop-like Big Data Computing jobs, where processing and communication of a large number of key-value pair instances are needed through distributed computation models such as MapReduce, Iteration, and Streaming. We abstract the characteristics of key-value communication patterns into a bipartite communication model, which reveals four distinctions from MPI: Dichotomic, Dynamic, Data-centric, and Diversified features. Utilizing this model, we propose the specification of a minimalistic extension to MPI. An open source communication library, DataMPI, is developed to implement this specification. Performance experiments show that DataMPI has significant advantages in performance and flexibility, while maintaining high productivity, scalability, and fault tolerance of Hadoop.
Keywords :
Big Data; application program interfaces; message passing; parallel processing; public domain software; DataMPI; Hadoop-like big data computing; bipartite communication model; data-centric features; dichotomic features; distributed computation models; diversified features; dynamic features; high performance computing; key-value communication patterns; key-value pair instances; open source communication library; Analytical models; Bandwidth; Big data; Computational modeling; Data models; Libraries; Schedules; Big Data; DataMPI; Hadoop; MPI; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2014 IEEE 28th International
Conference_Location :
Phoenix, AZ
ISSN :
1530-2075
Print_ISBN :
978-1-4799-3799-8
Type :
conf
DOI :
10.1109/IPDPS.2014.90
Filename :
6877314
Link To Document :
بازگشت