• DocumentCode
    2897902
  • Title

    A fault-tolerant directory-based cache coherence protocol for CMP architectures

  • Author

    Fernández-Pascual, Ricardo ; García, José M. ; Acacio, Manuel E. ; Duato, Jose

  • Author_Institution
    Univ. de Murcia, Murcia
  • fYear
    2008
  • fDate
    24-27 June 2008
  • Firstpage
    267
  • Lastpage
    276
  • Abstract
    Current technology trends of increased scale of integration are pushing CMOS technology into the deep-submicron domain, enabling the creation of chips with a significantly greater number of transistors but also more prone to transient failures. Hence, computer architects will have to consider reliability as a prime concern for future chip-multiprocessor designs (CMPs). Since the interconnection network of future CMPs will use a significant portion of the chip real state, it will be especially affected by transient failures. We propose to deal with this kind of failures at the level of the cache coherence protocol instead of ensuring the reliability of the network itself. Particularly, we have extended a directory-based cache coherence protocol to ensure correct program semantics even in presence of transient failures in the interconnection network. Additionally, we show that our proposal has virtually no impact on execution time with respect to a non fault-tolerant protocol, and just entails modest hardware and network traffic overhead.
  • Keywords
    CMOS integrated circuits; computer architecture; fault tolerant computing; CMOS technology; CMP architectures; chip-multiprocessor designs; deep-submicron domain; directory-based cache coherence protocol; fault-tolerant directory-based cache coherence protocol; interconnection network; program semantics; transient failures; CMOS technology; Coherence; Computer architecture; Computer network reliability; Fault tolerance; Hardware; Multiprocessor interconnection networks; Proposals; Protocols; Telecommunication traffic;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable Systems and Networks With FTCS and DCC, 2008. DSN 2008. IEEE International Conference on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4244-2397-2
  • Electronic_ISBN
    978-1-4244-2398-9
  • Type

    conf

  • DOI
    10.1109/DSN.2008.4630095
  • Filename
    4630095