Title :
An Implementation of the Message Passing Interface over an Adaptive Peer-to-Peer Network
Author :
Ni, Lei ; Harwood, Aaron
Author_Institution :
Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic.
Abstract :
Achieving high performance parallel computing requires both a large scale and reliable system. We describe our design and implementation of the message passing interface, called MPICH-OPeN, for parallel computing over a peer-to-peer network to address this challenge. Our implementation uses the Condor standalone checkpoint library and the Chandy-Lamport algorithm, for reliability, with extensions to make it decentralized. We use the OPeN architecture with an adaptive peer-to-peer protocol that caches connections between peers according to communication requirements of the parallel processes. We used PlanetLab to compare the performance of our implementation to MPICH-P4 and to measure the impact of dynamic peers on parallel program execution
Keywords :
application program interfaces; message passing; open systems; parallel programming; peer-to-peer computing; protocols; Chandy-Lamport algorithm; Condor standalone checkpoint library; MPICH-OPeN architecture; MPICH-P4; PlanetLab; adaptive peer-to-peer network; adaptive peer-to-peer protocol; high performance parallel computing; large scale reliable system; message passing interface; parallel program execution; Adaptive systems; Computer architecture; Extraterrestrial measurements; Large-scale systems; Libraries; Message passing; Parallel processing; Peer to peer computing; Protocols; Telecommunication network reliability;
Conference_Titel :
High Performance Distributed Computing, 2006 15th IEEE International Symposium on
Conference_Location :
Paris
Print_ISBN :
1-4244-0307-3
DOI :
10.1109/HPDC.2006.1652189