• DocumentCode
    1655789
  • Title

    Communication pattern based checkpointing coordination for fault-tolerant distributed computing systems

  • Author

    Park, Taesoon ; Yeom, Heon Y.

  • Author_Institution
    Dept. of Inf. Sci., Sejong Univ., Seoul, South Korea
  • fYear
    1998
  • Firstpage
    559
  • Lastpage
    562
  • Abstract
    This paper presents a new checkpointing coordination scheme which utilizes the communication pattern of the cooperating processes. In the proposed scheme, the checkpointing is coordinated for the limited number of processes based on the information regarding the communication pattern of the target program. Unlike the previous solutions which do not utilize the communication pattern, it is possible to reduce the coordination effort as well as the checkpointing frequency. Extensive simulation has been performed to evaluate the performance of the proposed scheme and we concluded that the proposed scheme significantly reduces the checkpointing overhead compared with the loose coordination schemes
  • Keywords
    concurrency control; distributed algorithms; fault tolerant computing; checkpointing; checkpointing coordination; distributed computing; fault-tolerant; Checkpointing; Distributed computing; Fault tolerance; Fault tolerant systems; Frequency; Information science; Performance evaluation; Resumes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Networking, 1998. (ICOIN-12) Proceedings., Twelfth International Conference on
  • Conference_Location
    Tokyo
  • Print_ISBN
    0-8186-7225-0
  • Type

    conf

  • DOI
    10.1109/ICOIN.1998.648447
  • Filename
    648447