Title :
Stateful Detection in High Throughput Distributed Systems
Author :
Khanna, Gunjan ; Laguna, Ignacio ; Arshad, Fahad A. ; Bagchi, Saurabh
Author_Institution :
Purdue Univ., West Lafayette
Abstract :
With the increasing speed of computers and the complexity of applications, many of today´s distributed systems exchange data at a high rate. Significant work has been done in error detection achieved through external fault tolerance systems. However, the high data rate coupled with complex detection can cause the capacity of the fault tolerance system to be exhausted resulting in low detection accuracy. We present a new stateful detection mechanism which observes the exchanged application messages, deduces the application state, and matches against anomaly-based rules. We extend our previous framework (the monitor) to incorporate a sampling approach which adjusts the rate of verified messages. The sampling approach avoids the previously reported breakdown in the monitor capacity at high application message rates, reduces the overall detection cost and allows the monitor to provide accurate detection. We apply the approach to a reliable multicast protocol (TRAM) and demonstrate its performance by comparing it with our previous framework.
Keywords :
computer network reliability; electronic messaging; error detection; fault tolerant computing; multicast protocols; sampling methods; telecommunication security; anomaly-based rule; fault tolerance system; high throughput distributed system; message exchange; reliable multicast protocol; sampling approach; stateful error detection; Application software; Computer errors; Costs; Distributed computing; Electric breakdown; Fault detection; Fault tolerant systems; Monitoring; Sampling methods; Throughput;
Conference_Titel :
Reliable Distributed Systems, 2007. SRDS 2007. 26th IEEE International Symposium on
Conference_Location :
Beijing
Print_ISBN :
0-7695-2995-X
DOI :
10.1109/SRDS.2007.15