Title :
NetLogger: a toolkit for distributed system performance analysis
Author :
Gunter, Dan ; Tierney, Brian ; Crowley, Brian ; Holding, Mason ; Lee, Jason
Author_Institution :
Lawrence Berkeley Lab., Berkeley, CA, USA
Abstract :
Diagnosis and debugging of performance problems on complex distributed systems requires end-to-end performance information at both the application and system level. We describe a methodology, called NetLogger, that enables real-time diagnosis of performance problems in such systems. The methodology includes tools for generating precision event logs, an interface to a system event-monitoring framework, and tools for visualizing the log data and real-time state of the distributed system. Low overhead is an important requirement for such tools, therefore we evaluate efficiency of the monitoring itself. The approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system
Keywords :
distributed programming; program debugging; program diagnostics; software performance evaluation; system monitoring; NetLogger toolkit; distributed system performance analysis; end-to-end performance information; interface; log data visualisation; low overhead; performance problem debugging; performance problem diagnosis; precision event logs; real-time state visualisation; system event-monitoring framework; Condition monitoring; Data visualization; Debugging; Delay; Laboratories; Operating systems; Performance analysis; Real time systems; System performance; Throughput;
Conference_Titel :
Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2000. Proceedings. 8th International Symposium on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7695-0728-X
DOI :
10.1109/MASCOT.2000.876548