• DocumentCode
    2234701
  • Title

    Micro-checkpointing: checkpointing for multithreaded applications

  • Author

    Whisnant, K. ; Kalbarczyk, Z. ; Iyer, R.K.

  • Author_Institution
    Center for Reliable & High Performance Comput., Illinois Univ., Urbana, IL, USA
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3
  • Lastpage
    8
  • Abstract
    In this paper we introduce an efficient technique for checkpointing multithreaded applications. Our approach makes use of processes constructed around the ARMOR (Adaptive Reconfigurable Mobile Objects of Reliability) paradigm implemented in our Chameleon testbed. ARMOR processes are composed of disjoint elements (objects) with controlled manipulation of element state. These characteristics of ARMORS allow the process state to be collected during runtime in an efficient manner and saved to disk when necessary. We call this approach micro-checkpointing. We demonstrate micro-checkpointing in the Chameleon testbed, an environment for developing reliable distributed applications. Our results show that the overhead ranges from between 39% to 141% with an aggressive checkpointing policy, depending upon the degree to which the process conforms to our ARMOR paradigm
  • Keywords
    error detection; multi-threading; program testing; software fault tolerance; software performance evaluation; ARMOR paradigm; Chameleon testbed; adaptive reconfigurable mobile objects of reliability; checkpointing; distributed applications development environment; micro-checkpointing; multithreaded applications; Application software; Checkpointing; Concurrent computing; Electrical capacitance tomography; Fault tolerance; Postal services; Read only memory; Runtime; Testing; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    On-Line Testing Workshop, 2000. Proceedings. 6th IEEE International
  • Conference_Location
    Palma de Mallorca
  • Print_ISBN
    0-7695-0646-1
  • Type

    conf

  • DOI
    10.1109/OLT.2000.856604
  • Filename
    856604