• DocumentCode
    3089697
  • Title

    Coordinated versus Uncoordinated Checkpoint Recovery for Network-on-Chip Based Systems

  • Author

    Rusu, Claudia ; Grecu, Cristian ; Anghel, Lorena

  • Author_Institution
    CNRS-UJF-INPG, Grenoble
  • fYear
    2008
  • fDate
    23-25 Jan. 2008
  • Firstpage
    32
  • Lastpage
    37
  • Abstract
    This paper presents and compares two failure recovery schemes developed for multi-core systems-on- chip that use network-on-chip communication infrastructures. The failure recovery methods are aimed towards fast recovery from system or application failures, when global reset is the last resort to recover a failed system. The first method uses coordinated checkpointing, while the second is based on uncoordinated checkpointing and message logging. Their effectiveness and overhead are evaluated and compared, under different application traffic loads and failure rates.
  • Keywords
    checkpointing; integrated circuit interconnections; integrated circuit reliability; integrated circuit testing; network-on-chip; communication infrastructure; coordinated checkpointing; failure recovery scheme; message logging; multicore systems-on-chip; network-on-chip based systems; on-chip interconnects; traffic loads; Checkpointing; Electronic equipment testing; Error correction; Fault tolerant systems; Laboratories; Network-on-a-chip; Power system interconnection; Protocols; Routing; System testing; checkpoint; failure rate; fault tolerance; message log; network-on-chip; recovery; rollback; traffic load;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronic Design, Test and Applications, 2008. DELTA 2008. 4th IEEE International Symposium on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-0-7695-3110-6
  • Type

    conf

  • DOI
    10.1109/DELTA.2008.75
  • Filename
    4459505