DocumentCode
2806534
Title
A Javaspace-Based Framework for Efficient Fault-Tolerant Master-Worker Distributed Applications
Author
Galtier, Virginie ; Makassikis, Constantinos ; Vialle, Stephane
Author_Institution
SUPELEC, UMI, France
fYear
2011
fDate
9-11 Feb. 2011
Firstpage
272
Lastpage
276
Abstract
We propose a framework built around a Java Space to ease the development of bag-of-tasks applications. The framework may optionally and automatically tolerate transient crash failures occurring on any of the distributed elements. It relies on check pointing and underlying middleware mechanisms to do so. To further improve check pointing efficiency, both in size and frequency, the programmer can introduce intermediate user-defined checkpoint data and code within the task processing program. The framework used without fault tolerance accelerates application development, does not introduce runtime overhead and yields to expected speedup. When enabling fault tolerance, our framework allows, despite failures, correct completion of applications with limited runtime and data storage overheads. Experiments run with up to 128 workers study the impact of some user-related and implementation-related on overall performance, and reveal good performances for classical Java Space-based master-worker application profiles.
Keywords
Java; fault tolerant computing; middleware; system recovery; Javaspace based framework; automatically tolerate transient crash failure; check pointing efficiency improvement; distributed element; fault tolerant master worker distributed application; java space based master worker application profile; middleware mechanism; task processing program; user defined checkpoint data; Checkpointing; Fault tolerance; Fault tolerant systems; Java; Middleware; Runtime; Transient analysis; checkpointing; distributed fault tolerance; framework; master-worker; user-framework-middleware cooperation;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
Conference_Location
Ayia Napa
ISSN
1066-6192
Print_ISBN
978-1-4244-9682-2
Type
conf
DOI
10.1109/PDP.2011.82
Filename
5738998
Link To Document