DocumentCode :
2609807
Title :
The NetLogger methodology for high performance distributed systems performance analysis
Author :
Tierney, Brian ; Johnston, William ; Crowley, Brian ; Hoo, Gary ; Brooks, Chris ; Gunter, Dan
Author_Institution :
Lawrence Berkeley Nat. Lab., California Univ., Berkeley, CA, USA
fYear :
1998
fDate :
28-31 Jul 1998
Firstpage :
260
Lastpage :
267
Abstract :
We describe a methodology that enables the real-time diagnosis of performance problems in complex high-performance distributed systems. The methodology includes tools for generating precision event logs that can be used to provide detailed end-to-end application and system level monitoring; a Java agent-based system for managing the large amount of logging data; and tools for visualizing the log data and real-time state of the distributed system. We developed these tools for analyzing a high-performance distributed system centered around the transfer of large amounts of data at high speeds from a distributed storage server to a remote visualization client. However this methodology should be generally applicable to any distributed system. This methodology called NetLogger has proven invaluable for diagnosing problems in networks and in distributed systems code. This approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system
Keywords :
client-server systems; computer networks; data visualisation; object-oriented languages; real-time systems; software performance evaluation; system monitoring; Java; NetLogger methodology; agent-based system; application-level monitoring; distributed storage server; high performance distributed systems; host monitoring; log data visualization; logging data management; network monitoring; performance analysis; precision event logs; real-time diagnosis; remote visualization client; system level monitoring; Data visualization; Delay; Distributed computing; High performance computing; Laboratories; Monitoring; Operating systems; Performance analysis; System performance; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Distributed Computing, 1998. Proceedings. The Seventh International Symposium on
Conference_Location :
Chicago, IL
ISSN :
1082-8907
Print_ISBN :
0-8186-8579-4
Type :
conf
DOI :
10.1109/HPDC.1998.709980
Filename :
709980
Link To Document :
بازگشت