DocumentCode
3089697
Title
Coordinated versus Uncoordinated Checkpoint Recovery for Network-on-Chip Based Systems
Author
Rusu, Claudia ; Grecu, Cristian ; Anghel, Lorena
Author_Institution
CNRS-UJF-INPG, Grenoble
fYear
2008
fDate
23-25 Jan. 2008
Firstpage
32
Lastpage
37
Abstract
This paper presents and compares two failure recovery schemes developed for multi-core systems-on- chip that use network-on-chip communication infrastructures. The failure recovery methods are aimed towards fast recovery from system or application failures, when global reset is the last resort to recover a failed system. The first method uses coordinated checkpointing, while the second is based on uncoordinated checkpointing and message logging. Their effectiveness and overhead are evaluated and compared, under different application traffic loads and failure rates.
Keywords
checkpointing; integrated circuit interconnections; integrated circuit reliability; integrated circuit testing; network-on-chip; communication infrastructure; coordinated checkpointing; failure recovery scheme; message logging; multicore systems-on-chip; network-on-chip based systems; on-chip interconnects; traffic loads; Checkpointing; Electronic equipment testing; Error correction; Fault tolerant systems; Laboratories; Network-on-a-chip; Power system interconnection; Protocols; Routing; System testing; checkpoint; failure rate; fault tolerance; message log; network-on-chip; recovery; rollback; traffic load;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronic Design, Test and Applications, 2008. DELTA 2008. 4th IEEE International Symposium on
Conference_Location
Hong Kong
Print_ISBN
978-0-7695-3110-6
Type
conf
DOI
10.1109/DELTA.2008.75
Filename
4459505
Link To Document