DocumentCode
1639432
Title
Synthesizing Byzantine Fault-Tolerant Grid Application Wrapper Services
Author
Hofer, Jürgen ; Fahringer, Thomas
Author_Institution
Distrib. & Parallel Syst. Group, Innsbruck Univ., Innsbruck
fYear
2008
Firstpage
467
Lastpage
474
Abstract
The grid is inherently unreliable due to its geographical dispersion, heterogeneity and the involvement of multiple administrative domains. The most general case of failures are so-called Byzantine failures where no assumptions about the behavior of faulty components can be made. In this paper a novel system is described that allows to diagnose and tolerate byzantine faults based on service replication. We suggest, briefly describe and compare two fail-stop and two byzantine fault tolerance algorithms. Given that many scientific larger-scale grid applications have complex outputs the comparison of replica results as needed to implement byzantine fault tolerance becomes a non-trivial task. Therefore we include an automation mechanism based on a generic description language and code generation for this particualar problem. Our approach has been implemented as extension to the Otho Toolkit, a system that synthesizes tailor-made wrapper services for a given application, grid environment and resource. An analysis of performance and overheads for three real-world applications completes our work.
Keywords
grid computing; software fault tolerance; Otho toolkit; byzantine failures; byzantine fault tolerance algorithms; generic description language; grid applications; service replication; tailor-made wrapper services; Application software; Automation; Computer crashes; Computer science; Concurrent computing; Control system synthesis; Distributed computing; Fault tolerance; Fault tolerant systems; Grid computing; Byzantine Fault Tolerance; Fault Tolerance; Grid; HPC;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2008. CCGRID '08. 8th IEEE International Symposium on
Conference_Location
Lyon
Print_ISBN
978-0-7695-3156-4
Electronic_ISBN
978-0-7695-3156-4
Type
conf
DOI
10.1109/CCGRID.2008.26
Filename
4534251
Link To Document