Title :
Data Remapping for Static NUCA in Degradable Chip Multiprocessors
Author :
Ying Wang ; Lei Zhang ; Yin-He Han ; Hua-Wei Li ; Xiaowei Li
Author_Institution :
State Key Lab. of Comput. Archit., Inst. of Comput. Technol., Beijing, China
Abstract :
In chip multiprocessors (CMPs), nonuniform cache architecture (NUCA) is often employed to organize last-level cache (LLC) banks through network-on-chip (NoC). Because of the shrinking feature size and unstable operating environment, severe reliability problems unavoidably emerge and cause frequent on-chip component (e.g., cores, cache banks, routers) failures. Typical fault-tolerant CMPs should possess the feature of graceful degradation and function normally with deactivated tiles. However, for CMPs adopting static NUCA, certain physical address areas will become inaccessible when cache banks in a CMP node are isolated from the system. To protect cache from such threats induced by either online or offline faults, we survey several potential solutions and propose the utility-driven node remapping technique by reusing the resources in NoC. In our NoC-assisted remapping scheme, cache accesses to isolated banks are so redirected that cache space contention are successfully balanced and relieved in shared-LLC, thus ensuring the least performance penalty caused by fault isolation. Our experimental results show significant performance improvement over conventional resizing approaches such as set reduction.
Keywords :
cache storage; fault diagnosis; fault tolerance; microprocessor chips; network-on-chip; LLC banks; NoC; cache banks; cache space contention; data remapping; degradable chip multiprocessors; fault isolation; fault-tolerant CMP; last-level cache; network-on-chip; nonuniform cache architecture; static NUCA; utility-driven node remapping technique; Circuit faults; Decoding; Fault tolerance; Fault tolerant systems; System-on-chip; Chip multiprocessor (CMP); fault tolerant; network-on-chip (NoC); nonuniform cache architecture (NUCA); nonuniform cache architecture (NUCA).;
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
DOI :
10.1109/TVLSI.2014.2325933