• DocumentCode
    3178686
  • Title

    Exploring performance-power tradeoffs in providing reliability for NoC-based MPSoCs

  • Author

    Zhao, Hui ; Kandemir, Mahmut ; Irwin, Mary Jane

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2011
  • fDate
    14-16 March 2011
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Performance and power consumption are important challenges faced by Network-on-Chip (NoC) designers. The situation is exacerbated when error control techniques are employed to provide reliability, since such techniques can lead to extra power consumption and execution cycles. In many systems today, ECC codes are used for error detection. Once an error is detected, recovery schemes are invoked to correct it. In this paper, we focus on tuning error recovery schemes to explore performance, power and reliability tradeoffs. Previous reliability work targeting NoCs proposed two retransmission techniques to recover from errors: End-to-End retransmission and Hop-by-Hop retransmission. End-to-End retransmission can save power but can also incur longer delays for recovery by checking errors only at the destination. In comparison, Hop-by-Hop retransmission checks for errors at every router and has better performance at the expense of increased power overhead. We propose a novel retransmission scheme that employs feedback control theory to dynamically choose the time for error checking based on the performance requirements of the applications. Our scheme ensures that applications meet performance QoS and save power at the same time. Our experimental evaluation shows that, if a 10% slack in delay is allowed, our scheme can save as much as 80% of the power consumed by the underlying error control scheme.
  • Keywords
    circuit feedback; error detection; multiprocessing systems; network routing; network-on-chip; power aware computing; quality of service; ECC code; NoC-based MPSoC reliability; QoS; end-to-end retransmission technique; error control technique; error detection; error recovery scheme; feedback control theory; hop-by-hop retransmission technique; multiprocessor SoC; network-on-chip design; performance-power tradeoff; power consumption; router; Adaptive control; Delay; Error analysis; Error correction codes; Feedback control; Quality of service; Reliability; NoC; Performance; Power; QoS; Reliability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Quality Electronic Design (ISQED), 2011 12th International Symposium on
  • Conference_Location
    Santa Clara, CA
  • ISSN
    1948-3287
  • Print_ISBN
    978-1-61284-913-3
  • Type

    conf

  • DOI
    10.1109/ISQED.2011.5770773
  • Filename
    5770773