• DocumentCode
    1811381
  • Title

    Efficient fault tolerance: an approach to deal with transient faults in multiprocessor architectures

  • Author

    Bondavalli, Andrea ; Chiaradonna, Silvano ; Di Giandomenico, Felicita

  • Author_Institution
    Istituto CNUCE, CNR, Pisa, Italy
  • fYear
    1994
  • fDate
    19-22 Dec 1994
  • Firstpage
    354
  • Lastpage
    359
  • Abstract
    Dynamic error processing approaches are an important mechanism to increase the reliability in a multiprocessor system, while making efficient use of the available resources. To this end, dynamic error processing must be integrated with a fault treatment approach aiming at optimising resource utilisation. In this paper we propose a diagnosis approach that, accounting for transient faults, tries to remove units very cautiously and to balance between two conflicting requirements. The first is to avoid the removal of units that have experienced transient faults and can be still useful for the system and the other is to avoid to keep failed units whose usage may lead to a premature failure of the system. The proposed fault treatment approach is integrated with a mechanism for dynamic error processing in a complete fault tolerance strategy. Reliability analyses based on the Markov approach and an efficiency evaluation performed by simulation are carried out
  • Keywords
    Markov processes; fault tolerant computing; multiprocessing systems; Markov approach; diagnosis approach; dynamic error processing; error processing; fault tolerance; multiprocessor architectures; reliability; resource utilisation; simulation; transient faults; Analytical models; Bonding; Computational modeling; Costs; Fault diagnosis; Fault tolerance; Performance analysis; Performance evaluation; Redundancy; Resource management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems, 1994. International Conference on
  • Conference_Location
    Hsinchu
  • Print_ISBN
    0-8186-6555-6
  • Type

    conf

  • DOI
    10.1109/ICPADS.1994.590322
  • Filename
    590322