Title :
Coding approaches to fault tolerance in linear dynamic systems
Author :
Hadjicostis, Christoforos N. ; Verghese, George C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Illinois, Urbana, IL, USA
Abstract :
This paper discusses fault tolerance in discrete-time dynamic systems, such as finite-state controllers or computer simulations, with focus on the use of coding techniques to efficiently provide fault tolerance to linear finite-state machines (LFSMs). Unlike traditional fault tolerance schemes, which rely heavily-particularly for dynamic systems operating over extended time horizons-on the assumption that the error-correcting mechanism is fault free, we are interested in the case when all components of the implementation are fault prone. The paper starts with a paradigmatic fault tolerance scheme that systematically adds redundancy into a discrete-time dynamic system in a way that achieves tolerance to transient faults in both the state transition and the error-correcting mechanisms. By combining this methodology with low-complexity error-correcting coding, we then obtain an efficient way of providing fault tolerance to k identical unreliable LFSMs that operate in parallel on distinct input sequences. The overall construction requires only a constant amount of redundant hardware per machine (but sufficiently large k) to achieve an arbitrarily small probability of overall failure for any prespecified (finite) time interval, leading in this way to a lower bound on the computational capacity of unreliable LFSMs.
Keywords :
discrete systems; error correction codes; fault tolerance; fault trees; finite state machines; linear systems; redundancy; LFSM; discrete-time dynamic system; error-correcting coding; fault free; linear finite-state machines; paradigmatic fault tolerance; redundant hardware per machine; state transition; Circuit faults; Computer errors; Computer simulation; Electromagnetic interference; Electromagnetic transients; Fault tolerance; Fault tolerant systems; Hardware; Protection; Redundancy;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2004.839491