• DocumentCode
    1639432
  • Title

    Synthesizing Byzantine Fault-Tolerant Grid Application Wrapper Services

  • Author

    Hofer, Jürgen ; Fahringer, Thomas

  • Author_Institution
    Distrib. & Parallel Syst. Group, Innsbruck Univ., Innsbruck
  • fYear
    2008
  • Firstpage
    467
  • Lastpage
    474
  • Abstract
    The grid is inherently unreliable due to its geographical dispersion, heterogeneity and the involvement of multiple administrative domains. The most general case of failures are so-called Byzantine failures where no assumptions about the behavior of faulty components can be made. In this paper a novel system is described that allows to diagnose and tolerate byzantine faults based on service replication. We suggest, briefly describe and compare two fail-stop and two byzantine fault tolerance algorithms. Given that many scientific larger-scale grid applications have complex outputs the comparison of replica results as needed to implement byzantine fault tolerance becomes a non-trivial task. Therefore we include an automation mechanism based on a generic description language and code generation for this particualar problem. Our approach has been implemented as extension to the Otho Toolkit, a system that synthesizes tailor-made wrapper services for a given application, grid environment and resource. An analysis of performance and overheads for three real-world applications completes our work.
  • Keywords
    grid computing; software fault tolerance; Otho toolkit; byzantine failures; byzantine fault tolerance algorithms; generic description language; grid applications; service replication; tailor-made wrapper services; Application software; Automation; Computer crashes; Computer science; Concurrent computing; Control system synthesis; Distributed computing; Fault tolerance; Fault tolerant systems; Grid computing; Byzantine Fault Tolerance; Fault Tolerance; Grid; HPC;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2008. CCGRID '08. 8th IEEE International Symposium on
  • Conference_Location
    Lyon
  • Print_ISBN
    978-0-7695-3156-4
  • Electronic_ISBN
    978-0-7695-3156-4
  • Type

    conf

  • DOI
    10.1109/CCGRID.2008.26
  • Filename
    4534251