Title :
Efficient monitoring to detect wireless channel failures for MPI programs
Author :
Macías, Elsa M. ; Suárez, Alvaro ; Sunderam, Vaidy
Author_Institution :
Dept. de Ingenieria Telematica, Univ. of Las Palmas de Gran Canaria, Spain
Abstract :
In the last few years the use of wireless technology has increased by leaps and bounds and as a result powerful portable computers with wireless cards are viable nodes in parallel distributed computing. In this scenario it is natural to consider the possibility of frequent failures in the wireless channel. In MPI programs, such wireless network behavior is reflected as communication failure. Although the MPI standard does not handle failures, there are some projects that address this issue. To the best of our knowledge there is no previous work that presents a practical solution for fault-handling in MPI programs that run on wireless environments. We present a mechanism at the application level, that combined with wireless network monitoring software detects these failures and warns MPI applications to enable them to take appropriate action.
Keywords :
application program interfaces; message passing; parallel processing; portable computers; system monitoring; telecommunication channels; wireless LAN; MPI programs; parallel distributed computing; portable computers; wireless cards; wireless channel failure; wireless network behavior; wireless network monitoring software; Application software; Computer networks; Computerized monitoring; Concurrent computing; Condition monitoring; Distributed computing; Middleware; Parallel processing; Wireless communication; Wireless networks;
Conference_Titel :
Parallel, Distributed and Network-Based Processing, 2004. Proceedings. 12th Euromicro Conference on
Print_ISBN :
0-7695-2083-9
DOI :
10.1109/EMPDP.2004.1271469