DocumentCode
2234701
Title
Micro-checkpointing: checkpointing for multithreaded applications
Author
Whisnant, K. ; Kalbarczyk, Z. ; Iyer, R.K.
Author_Institution
Center for Reliable & High Performance Comput., Illinois Univ., Urbana, IL, USA
fYear
2000
fDate
2000
Firstpage
3
Lastpage
8
Abstract
In this paper we introduce an efficient technique for checkpointing multithreaded applications. Our approach makes use of processes constructed around the ARMOR (Adaptive Reconfigurable Mobile Objects of Reliability) paradigm implemented in our Chameleon testbed. ARMOR processes are composed of disjoint elements (objects) with controlled manipulation of element state. These characteristics of ARMORS allow the process state to be collected during runtime in an efficient manner and saved to disk when necessary. We call this approach micro-checkpointing. We demonstrate micro-checkpointing in the Chameleon testbed, an environment for developing reliable distributed applications. Our results show that the overhead ranges from between 39% to 141% with an aggressive checkpointing policy, depending upon the degree to which the process conforms to our ARMOR paradigm
Keywords
error detection; multi-threading; program testing; software fault tolerance; software performance evaluation; ARMOR paradigm; Chameleon testbed; adaptive reconfigurable mobile objects of reliability; checkpointing; distributed applications development environment; micro-checkpointing; multithreaded applications; Application software; Checkpointing; Concurrent computing; Electrical capacitance tomography; Fault tolerance; Postal services; Read only memory; Runtime; Testing; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
On-Line Testing Workshop, 2000. Proceedings. 6th IEEE International
Conference_Location
Palma de Mallorca
Print_ISBN
0-7695-0646-1
Type
conf
DOI
10.1109/OLT.2000.856604
Filename
856604
Link To Document