Title :
An improved on-line monitoring technique for a fault-tolerant computing node
Author :
Elahresh, Musbah ; Djordjevic, Jovan ; Tomasevic, Milo ; Aleksic, Milivoje
Author_Institution :
Fac. of Electr. Eng., Belgrade Univ., Serbia
Abstract :
A fault tolerant computing node is an indispensable component of reliable distributed computer systems devised for life critical applications. The on-line monitoring technique is frequently used for error detection in such systems and assumes the use of an external hardware monitor connected to the system bus. It does the control flow checking based on signatures assigned to each block of an application. However, this technique cannot be used with contemporary processors with the built-in cache. Therefore, an improved on-line monitoring technique which overcomes this problem is proposed in the paper.
Keywords :
computerised monitoring; error detection; fault tolerant computing; flow graphs; safety-critical software; control flow graph; error detection; fault-tolerant computing node; life critical applications; on-line monitoring; reliable distributed computer systems; Application software; Computer errors; Computerized monitoring; Distributed computing; Error correction; Fault detection; Fault tolerance; Fault tolerant systems; Flow graphs; System buses;
Conference_Titel :
Electrical and Computer Engineering, 2004. Canadian Conference on
Print_ISBN :
0-7803-8253-6
DOI :
10.1109/CCECE.2004.1349703