• DocumentCode
    1961289
  • Title

    High Performance Fault-Tolerant Routing Algorithm for NoC-Based Many-Core Systems

  • Author

    Ebrahimi, Mojtaba ; Daneshtalab, Masoud ; Plosila, Juha

  • Author_Institution
    Dept. of Inf. Technol., Univ. of Turku, Turku, Finland
  • fYear
    2013
  • fDate
    Feb. 27 2013-March 1 2013
  • Firstpage
    462
  • Lastpage
    469
  • Abstract
    Networks-on-Chip (NoCs) has become a promising approach for the on-chip communication infrastructure of many-core Systems-on-Chip (SoCs). Faults may occur in the NoC both at the router and link level. There are many fault-tolerant approaches presented both in the off-chip and on-chip networks. Some approaches disable some healthy components in order to form a specific shape and others not. Regardless of all varieties, there has always been a common assumption among them. Most of all traditional fault-tolerant methods are based on rerouting packets around a faulty node or region. These approaches affect the performance significantly not only by taking longer paths but also by creating hotspot around a fault. The focus of this paper is to maintain the performance of NoC in the presence of faults. The presented method takes advantage of a fully adaptive routing algorithm using one and two virtual channels along the X and Y dimensions. This method is able to tolerate all cases of one-faulty node without losing the performance of NoC. According to the experimental results, this presented fault-tolerant routing algorithm is able to support up to six faulty nodes in the 8×8 mesh network by up to 98% reliability.
  • Keywords
    fault tolerant computing; multiprocessing systems; network routing; network-on-chip; performance evaluation; NoC performance; NoC-based many-core systems; SoC; adaptive routing algorithm; high performance fault tolerant routing algorithm; link level faults; network-on-chip; off-chip networks; on-chip communication infrastructure; router Faults; system-on-chip; virtual channels; Fault tolerance; Fault tolerant systems; Routing; Shape; Switches; System recovery; System-on-chip; Fully adaptive algorithms; Networks-on-Chip; fault-tolerant approaches; the shortest paths;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2013 21st Euromicro International Conference on
  • Conference_Location
    Belfast
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4673-5321-2
  • Electronic_ISBN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2013.75
  • Filename
    6498591