Title :
Low overhead distributed diagnostic algorithms for very large multiple processor systems
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Wisconsin Univ., Milwaukee, WI, USA
Abstract :
The increasing need for the design of high-performance, highly reliable systems has led to the design of very large systems made up of hundreds of thousands of processors. The author proposes distributed algorithms for testing and reconfiguration of these systems. In these algorithms the number of tests and the amount of testing message overhead are reduced by making testing assignment dynamic. Initially a small number of processors, ideally one, is assigned to test every processor, and when some of the processor or communication channels fail, a new testing assignment is made to assign again a small number of testers to every processor.<>
Keywords :
distributed processing; fault tolerant computing; multiprocessing systems; low overhead distributed diagnostic algorithms; testing assignment; very large multiple processor systems; Automatic testing; Communication channels; Concurrent computing; Fault diagnosis; Performance evaluation; Search problems; System performance; System testing; Very large scale integration;
Conference_Titel :
Computers and Communications, 1989. Conference Proceedings., Eighth Annual International Phoenix Conference on
Conference_Location :
Scottsdale, AZ, USA
Print_ISBN :
0-8186-1918-x
DOI :
10.1109/PCCC.1989.37438