• DocumentCode
    2199916
  • Title

    Checkpointing with multicast communication

  • Author

    Lumpp, J.E., Jr

  • Author_Institution
    Dept. of Electr. Eng., Kentucky Univ., Lexington, KY
  • Volume
    4
  • fYear
    1998
  • fDate
    21-28 Mar 1998
  • Firstpage
    467
  • Abstract
    For long-running or large-scale distributed programs, the ability to provide software fault-tolerance via checkpointing is valuable. For scalable systems, multicast communication is becoming a predominant communication paradigm. While some aspects of consistency and channel state are the same for both unicast and multicast protocols, the implementation of checkpointing systems differ. This paper explores the problem of checkpointing in a multicast environment and introduces two checkpointing algorithms for such environments. The first algorithm is closely based on existing checkpointing algorithms. The second employs the multicast protocol to distribute checkpointing information efficiently
  • Keywords
    internetworking; multicast communication; software fault tolerance; transport protocols; checkpointing information; checkpointing systems; large-scale distributed programs; multicast communication; multicast protocol; scalable systems; software fault-tolerance; unicast protocols; Checkpointing; Ethernet networks; Fault tolerance; Hardware; Large-scale systems; Multicast algorithms; Multicast communication; Multicast protocols; Operating systems; Unicast;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Aerospace Conference, 1998 IEEE
  • Conference_Location
    Snowmass at Aspen, CO
  • ISSN
    1095-323X
  • Print_ISBN
    0-7803-4311-5
  • Type

    conf

  • DOI
    10.1109/AERO.1998.682213
  • Filename
    682213