DocumentCode :
2483123
Title :
Failure handling in an optimized two-safe approach to maintaining primary-backup systems
Author :
Hu, Kexiang ; Mehrotra, Sharad ; Kaplan, Simon
Author_Institution :
Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA
fYear :
1998
fDate :
20-23 Oct 1998
Firstpage :
161
Lastpage :
167
Abstract :
In a primary backup database system, transaction processing takes place at the primary and the log records generated are propagated to the backup which uses them to reconstruct the database state at the primary. If the primary fails, the backup takes over to provide continued service. Most existing designs of primary backup database systems have concentrated on techniques to tolerate complete failures in which the entire primary fails, say due to a disaster. In multiprocessor environments, where the primary and the backup databases are partitioned across multiple computers, a more common case is a partial failure in which some database partitions fail but the system as a whole survives. Existing approaches either ignore partial failures, or require the failed database partition to be unavailable. We explore a design of the primary backup database system that uses the backup not only for disaster protection, but also for continued availability during partial failures. The approach is developed in the context of the improved optimized 2-safe strategy to transmitting logs from the primary to the backup, introduced by K. Hu et al. (1997), which combines the best features of the previously developed 1-safe and 2-safe strategies
Keywords :
back-up procedures; concurrency control; disasters; multiprocessing systems; software fault tolerance; transaction processing; backup databases; complete failures; continued availability; database partitions; database state; disaster protection; failure handling; log records; multiple computers; multiprocessor environments; optimized 2-safe strategy; optimized two-safe approach; partial failure; primary backup database system; primary backup systems maintenance; transaction processing; Computer science; Database systems; Delay effects; Earthquakes; Information technology; Propagation losses; Protection; Standby generators; Throughput; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Reliable Distributed Systems, 1998. Proceedings. Seventeenth IEEE Symposium on
Conference_Location :
West Lafayette, IN
ISSN :
1060-9857
Print_ISBN :
0-8186-9218-9
Type :
conf
DOI :
10.1109/RELDIS.1998.740488
Filename :
740488
Link To Document :
بازگشت