• DocumentCode
    1513691
  • Title

    Dynamic Transfer of Computation to Processor Cache for Yield and Reliability Improvement

  • Author

    Paul, Somnath ; Bhunia, Swarup

  • Author_Institution
    Dept. of EECS, Case Western Reserve Univ., Cleveland, OH, USA
  • Volume
    19
  • Issue
    8
  • fYear
    2011
  • Firstpage
    1368
  • Lastpage
    1379
  • Abstract
    VLSI systems in the nanometer regime suffer from high defect rates and large parametric variations that lead to yield loss as well as reduced reliability of operation. An architectural framework that ensures proper system operation when few functional units are defective or unreliable under process-induced or temporal parametric variations can be effective in improving manufacturing yield and overall system reliability. In this paper, we propose a novel memory-based computational framework that exploits the on-chip memory to perform computation on demand using a lookup table (LUT)-based approach. The framework achieves reliable operation by transferring activity to embedded memory of a processor from a defective or unreliable functional unit. This allows the die to run at a reduced (but acceptable) performance level instead of being completely discarded due to unit failure (in case of defective functional unit) or being throttled (in case of temporal parameter variations, e.g., temperature induced variations). We note that although the worst-case latency of memory based computation can be considerably higher than regular operation latency, the average latency is only modestly higher due to the abundance of narrow-width operands. Furthermore, the operands for a specific instruction (e.g., integer add, multiply, or floating point add) experience high locality of reference and thus require loading only part of the LUTs in the cache. Simulation results for a set of benchmark applications show that the proposed scheme can significantly improve yield and reliability at the cost of only a small loss in performance (on an average 0.8%) and 10 × less area overhead compared to hardware duplication based defect tolerance approach.
  • Keywords
    VLSI; microprocessor chips; reliability; VLSI system; architectural framework; dynamic transfer; hardware duplication based defect tolerance; lookup table; manufacturing yield; memory based computation; memory-based computational framework; nanometer regime; narrow-width operands; on-chip memory; overall system reliability; processor cache; proper system operation; temporal parametric variation; unreliable functional unit; worst-case latency; yield and reliability improvement; CMOS technology; Computational modeling; Costs; Logic; Manufacturing processes; Propagation delay; Reliability; Table lookup; Temperature; Thermal management; Activity transfer; memory-based computation; reliability; thermal management; yield;
  • fLanguage
    English
  • Journal_Title
    Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-8210
  • Type

    jour

  • DOI
    10.1109/TVLSI.2010.2049389
  • Filename
    5483140